Optimization of cancer treatment with irinotecan

ABSTRACT

Various embodiments of the invention include methods and compositions for evaluating the risk of irinotecan toxicity in a patient. In certain embodiments, the methods include detecting a promoter polymorphism in one or both UGT1A1 genes of the patient. In particular embodiments the promoter polymorphism is a single nucleotide polymorphism and may be in linkage disequilibrium with a UGT1A1 (TA) n  repeat. The methods may include obtaining a nucleic acid sample from the patient and detecting the presence or absence of a promoter polymorphism. The promoter polymorphism may be at nucleotide position −3440, −3401, −3279, −3177, −3175, or −3156 from the UGT1A1 gene transcriptional start site. The number of TA repeats can be 5, 6, 7, 8 more TA repeats. In particular embodiments, the promoter polymorphism is a −3440C&gt;A, −3401T&gt;C, −3279G&gt;T, −3177C&gt;G, −3175A&gt;G, −3156G&gt;A polymorphism or any combination thereof. Moreover, in other embodiments, identification of a guanine residue at position −3156 indicates the patient does not have a low level of UGT1A1 activity, and therefore, methods and compositions of the invention concern this identification.

[0001] This application claims priority to U.S. Provisional Patent Application No. 60/437,928 filed on Jan. 3, 2003, U.S. Provisional Patent Application No. 60/446,942 filed on Feb. 12, 2003, and U.S. Provisional Patent Application No. 60/474,826 filed on May 30, 2003, all of which are incorporated by reference in their entireties.

[0002] The government may own rights in the present invention pursuant to grant number GM61393 from the National Institutes of Health.

BACKGROUND OF THE INVENTION

[0003] 1. Field of the Invention

[0004] The present invention relates generally to the fields of molecular genetics, pharmacogenetics, and cancer therapy. In particular, the present invention is directed to methods and compositions for detecting polymorphisms and correlating the presence or absence of certain polymporphisms with toxic effects of chemotherapies. More specifically, the present invention is directed to methods and compositions for determining the presence or absence of polymorphisms within a uridine diphosphate glucuronosyltransferase 1A1 (UGT1A1) promoter and correlating these polymorphisms with toxic effects of irinotecan, as well as evaluating the risk of an individual for developing irinotecan toxicity. In some embodiments, the invention concerns methods and compositions for predicting or anticipating the level of toxicity caused by irinotecan and other compounds glucuronidated by a UGT enzyme in a patient. Such methods and compositions can be used to evaluate whether irinotecan-based therapy or therapy involving a UGT substrate may pose toxicity problems if given to a particular patient. Alterations in suggested therapy may ensue if a toxicity risk is assessed.

[0005] 2. Description of Related Art

[0006] Glucuronidation plays a major role in the pharmacological activity and clearance of a large variety of compounds (Tukey and Strassburg, 2000). Genetic studies of UDP-glucuronosyltransferases (UGTs) aim to characterize an individual's predisposition to various diseases and increased risk of adverse outcome to drug treatment. The variation in the UDP-glucuronosyltransferase 1A1 (UGT1A1) gene is the most extensively studied. UGT1A1 basal expression is affected by the variable number of TA repeats in the TATA box, i.e., (TA)_(n), see U.S. Pat. No. 6,395,481, which is incorporated herein by reference. A variable number of repeats (5, 6, 7, and 8) have been found in the UGT1A1 TATA box. Gene transcriptional efficiency has been inversely correlated to the number of TA repeats (Beutler et al., 1998). Thus, a larger TA repeat number is associated with reduced transcriptional activity (Beutler et al., 1998) leading to various degrees of impaired glucuronidation of UGT1A1 substrates.

[0007] Homozygosity for (TA)₇ allele is associated with Gilbert's syndrome (a familial mild hyperbilirubinemia) (Bosma et al., 1995 and Monaghan et al., 1996) and predisposition to the toxic effects of cancer treatment with irinotecan (Ando et al., 2000 and Iyer et al., 2002). Gilbert's syndrome has also been associated with missense coding variants in the UGT1A1 gene, in particular in Asian populations where these variants are relatively common. Increased risk of breast cancer was reported in African-American women who carried the (TA)₇ and (TA)₈ alleles (Guillemette et al., 2000). In addition to the TATA box, Sugatani et al., (2001) identified a region in the UGT1A1 promoter approximately 3 kb upstream of the TATA box that regulates UGT1A1 inducibility by phenobarbital. It is also hypothesized that this phenobarbital-responsive enhancer module (PBREM) might be modulated by endogenous factors (Sugatani et al., 2002). UGT1A1 activity is probably the result of PBREM-dependent modulation of TATA box-dependent basal expression.

[0008] Polymorphisms in UGT1A1 are relevant to the treatment of cancer patients with irinotecan. Irinotecan is a topoisomerase I inhibitor that is approved worldwide for the treatment of metastatic colorectal cancer. Irinotecan has a well established role as single agent in 5-fluorouracil-refractory patients (Rougier et al., 1998; Cunningham et al., 1998), as well as in combination with 5-fluorouracil/leucovorin as a first-line therapy (Saltz et al., 2000; Rothenberg et al., 2001).

[0009] Irinotecan hydrolysis by carboxylesterase-2 is responsible for its activation to SN-38 (7-ethyl-10-hydroxycamptothecin), a topoisomerase I inhibitor of much higher potency than irinotecan. The main inactivating pathway of irinotecan is the biotransformation of active SN-38 into inactive SN-38 glucuronide (SN-38G). Interpatient differences in systemic formation of SN-38G have been shown to have clear clinical consequences in patients treated with irinotecan. Patients with higher glucuronidation of SN-38 are more likely to be protected from the dose limiting toxicity of diarrhea in the weekly schedule (Gupta et al., 1994). SN-38 is glucuronidated by UDP-glucuronosyltransferase 1A1 (UGT1A1) (Iyer et al., 1997).

[0010] Despite its efficacy in treating metastatic colon cancer and its broad spectrum of activity in other tumor types, irinotecan treatment is associated with significant toxicity. SN-38 is an active metabolite of irinotecan, and SN-38 glucuronidation represents a mechanism to protect patients from the toxic effects of irinotecan (Gupta et al., 1994). Reduced SN-38 glucuronidation is thought to underlie the severe toxicity associated with irinotecan treatment in some patients (Gupta et al., 1994). The main severe toxicities of irinotecan are delayed diarrhea and myelosuppression. In the early single agent trials, grade 3-4 diarrhea occurred in about one third of patients and was dose limiting (Negoro et al., 1991; Rothenberg et al., 1993). Its frequency varies from study to study and is also schedule dependent. The frequency of grade 3-4 diarrhea in the three-weekly regimen (19%) is significantly lower compared to the weekly schedule (36%, Fuchs et al., 2003). In addition to diarrhea, grade 3-4 neutropenia is also a common adverse event, with about 30-40% of the patients experiencing it in both weekly and three-weekly regimens (Fuchs et al., 2003; Vanhoefer et al., 2001). Fatal events during irinotecan treatment have been reported. A high mortality rate of 5.3% and 1.6% was reported in the weekly and three-weekly single agent irinotecan regimens, respectively (Fuchs et al., 2003).

[0011] Although retrospective analysis of UGT1A1 genetic variation in relation to severe toxicity after different irinotecan-based regimens has been conducted in Japanese patients (Ando et al., 2000), prospective evaluation in a large trial has not been performed.

[0012] Thus, the problem of identifying the effects of various promoter polymorphism combinations on the expression of UGT1A1 for the determination of UGT activity levels remains. Improved methods and compositions for the evaluation of risk for irinotecan toxicity in an individual or patient are still needed.

SUMMARY OF THE INVENTION

[0013] Metabolism of SN-38, an active metabolite of irinotecan, via glucuronidation represents a mechanism to protect patients from the toxic effects of irinotecan, thus a reduction in SN-38 glucuronidation contributes to the probability that toxicity associated with irinotecan may be experienced in patients. While some genetic basis for reduced SN-38 glucuronidation have been identified, other basis have yet to be identified. Therefore, there remains a need for improved methods and compositions for evaluating polymorphisms in one or both UGT1A1 genes of a patient and correlating a genotype with adverse effects of various therapies.

[0014] The present invention is based on the fact that genetic variation is correlated with UGT1A1 expression and has several important clinical implications. The improved methods and compositions of the present invention may be used in determining if a treatment has a propensity to adversely affect a patient or what treatment may be appropriate or inappropriate for a particular patient. UGT1A1 basal transcription is affected by a polymorphic (TA) repeat (see FIG. 1 legend in Innocenti et al., 2002), in addition to a phenobarbital-responsive enhancer module (PBREM) that contains variants affecting inducible gene expression, as described herein. A “polymorphism” or “genetic polymorphism,” as referred to herein, is the existence of two or more variant forms of a particular characteristic, e.g., a single nucleotide or a repeat a nucleotide or nucleotides. Generally, variations are due to the addition, deletion, or substitution of one or more nucleotides at a site or a variation in the number of tandem repeats of a DNA sequence. In various embodiments, other polymorphisms within or outside the UGT1 gene locus, see Genbank accession number AF297093 which is incorporated herein by reference, may be used as long as an association of a polymorphism with a particular phenotype and/or haplotype can be established. Exemplary methods for genotyping a UGT1A gene may be found at least in U.S. Pat. Nos. 6,479,236, 6,472,157 and 6,395,481, each of which is incorporated herein by reference.

[0015] In various embodiments of the invention, significant linkage disequilibrium between a (TA) polymorphism and variants in the PBREM, or other variants within or outside the UGT1 gene locus, indicates that patients possessing such other variants may be at risk of irinotecan toxicity. “Significant” as used in respect to linkage disequilibrium, as determined by one of skill in the art, is contemplated to be a statistical p or α value that may be 0.25 or 0.1 and may be 0.1, 0.05, 0.001, 0.00001 or less. “Linkage disequilibrium” (“LD” as used herein, though also referred to as “LED” in the art) refers to a situation where a particular combination of alleles (i.e., a variant form of a given gene) or polymorphisms at two loci appears more frequently than would be expected by chance. The relationship between PBREM-(TA)_(n) haplotypes and the glucuronidation rate of the UGT1A1 substrate SN-38 may be used to correlate the genotype (i.e., the genetic make up of an organism) to a phentoype (i.e., the physical traits displayed by an organism or cell). “Haplotype” is used herein to refer to a collective genotype of two or more closely linked loci. Each haplotype defines the sequence of alleles or polymorphisms along one of the homologous chromosomes. In some embodiments, the polymorphisms may be 0.001, 0.01, 0.1, 0.2 cM or more within one another.

[0016] Various embodiments of the invention include methods for evaluating the risk of toxicity from irinotecan, or other UGT1A1 substrates, in a patient. A polymorphism may be a single nucleotide polymorphism (SNP) and may be in linkage disequilibrium with a (TA)_(n) repeat. In certain embodiments, the methods include detecting one or more polymorphisms in one or both copies of the UGT1A1 gene and/or one or both copies of any other gene located in the UGT1 gene locus of a patient. In particular embodiments a promoter polymorphism is detected. It is specifically contemplated that methods and compositions of the invention may be implemented to determine whether UGT1A1 polymorphisms are present or absent in one or both alleles.

[0017] In certain embodiments, a polymorphism may be a polymorphism that affects the transcription of UGT1A1, such as in the promoter region or 5′ flanking region that affects transcription (which includes the promoter region), and in particular a polymorphism at nucleotide position −3440, −3401, −3279, −3177, −3175, or −3156 from the UGT1A1 gene transcriptional start site, which is designated +1 with no nucleotide designated as 0. The number of TA repeats can be 5, 6, 7, 8 or more TA repeats. In particular embodiments, the polymorphism is the following: −3440C>A, −3401T>C, −3279G>T, −3177C>G, −3175A>G, −3156G>A, or any combination thereof. The notation −3440C>A, for example indicates that cytosine nucleotide (C) at the −3440 position is replaced by an Adenosine (A).

[0018] Methods of the invention may include obtaining a nucleic acid sample from a patient and detecting one or more polymorphisms in the UGT1A1 gene using various methods. In certain embodiments, polymorphism detection may include amplifying a nucleic acid containing all or part of a particular region of the UGT1A1 gene to obtain amplification products; and/or analyzing the amplification products for the presence or absence of one or more polymorphisms. Other methods of polymorphism detection known in the art are also contemplated.

[0019] In certain embodiments, a promoter polymorphism of a UGT1A1 gene may be detected by performing one of a variety of known assays. These may include, but are not limited to hybridization assays, sequencing or microsequencing assays, allele-specific amplification assays or any other methods known for detecting nucleic acid polymorphisms, which may or may not include amplification of a nucleic acid. It is understood that “detecting” a polymorphism includes identifying the nucleotide sequence at that site and/or determining whether the polymorphism is present or absent.

[0020] A correlation between one or more polymorphisms and the glucoronidation rate of irinotecan or other substrates of UGT1A1, including but not limited to bilirubin, estriol, beta-estradiol, 2-hydroxyestriol, 2-hydroxyestrone, 2-hydroxyestradiol, thyroxine (T4), rT3, octyl gallate, propyl gallate, anthraflavic acid, quercitin, fisetin, naringenin, 1-naphtol, and ethynylestradiol, may be used to determine various aspects of a treatment regime, including irinotecan and/or other drugs or compounds metabolized directly or indirectly by UGT1A1. In some embodiments the methods also include analyzing the glucuronidation rate associated with the various polymorphisms and polymorphism combinations, for exemplary methods and compositions related to analysis of glucuronidation rates see U.S. Pat. No. 6,319,678, which is incorporated herein by reference. The methods may also include determining the biliary transport capacity of the patient. In particular embodiments the evaluation of the promoter polymorphism may be used to optimize the dose of irinotecan or other compounds for treatment of a patient or to reduce their toxicity.

[0021] The methods of the invention may further include treating a patient by administering to the patient irinotecan in combination with other pharmaceutical agents at appropriate dosages, such that the toxicity of irinotecan or other substrates of UGT1A1 are reduced. In particular embodiments, a second agent that reduces excretion of an active irinotecan species through the bile may administered in conjunction with irinotecan based upon determinations made using methods and compositions of the invention, for related methods and compositions see U.S. Pat. Nos. 6,407,117, 6,287,834 and 5,786,344, each of which is incorporated herein by reference.

[0022] The present invention is also based on the observation that the nucleotide at position −3516 in the UGT1A1 upstream region is correlated with irinotecan toxicity. An A at that position positively correlates with irinotecan toxicity while a G at that position correlates with tolerance to irinotecan. Thus, the present invention concerns methods and compositions for evaluating, predicting, and determining whether a patient will experience toxicity from irinotecan. Toxicity from irinotecan evidences itself as side effects from the drug, which are well known to oncologists and their patients.

[0023] In some embodiments of the invention, there are methods of predicting whether a patient may suffer or be subject to toxicity from irinotecan if given it involving determining the nucleic acid sequence of base −3516 in the UGT1A1 5′ flanking region in one or both alleles of the patient. The presence of an A nucleotide indicates the person is at risk for irinotecan toxicity. An AA genotype is more closely correlated with grade 4 neutropenia than other genotypes at that position. Moreover, in some embodiments, this is unrelated to the genotype of the TA indel in the UGT1A1 promoter. It is contemplated that these methods concerning the indel at position −3516 in the UGT1A1 5′ flanking region can be implemented with methods involving determining one or more other polymorphisms in the UGT1A1 5′ flanking region of the same patient.

[0024] Consequently, if a person is identified as at risk for irinotecan toxicity based on any of the embodiments discussed herein, an alternative course of therapy or a lower dose of irinotecan than is normally given may be contemplated. In addition, methods also include determining the sequence of other polymorphisms or indels (insertion/deletions) in linkage disequilibrium (LD) with the −3516 variant. Therefore, in some embodiments of the invention, the TA indel is evaluated to determine the number of repeats. Also, any other variant in UGT1A1 or any other gene (the term “gene” includes non-coding regions that affect the expression or activity level of the encoded polypeptide) may be evaluated for variants in LD with the −3516 variant.

[0025] Various embodiments may include a kit for evaluating the risk of irinotecan toxicity in a patient. The kit may include a variety of containers, reagents and the like. In certain embodiments, the kit may include an oligonucleotide primer to amplify a promoter region of a UGT1A1 gene or genes, haplotype tag SNPs or allele specific amplification primers of the UGT1A1 gene or any other primer within the UGT1 gene locus. The haplotype tag SNPs or allele specific primers may be used to amplify a polymorphism at one or more nucleotide positions of the UGT1A1 gene or other UGT1 locus gene. In particular embodiments, the nucleotide position may be at −3440, −3401, −3279, −3177, −3175, or −3156, or a combination thereof, from the UGT1A1 gene transcriptional start site. The kit may include the haplotype tag SNPs or allele specific amplification primers in multi-well assay plate. The kit may also include haplotype tag SNPs or allele specific hybridization probes for a variety of promoter polymorphisms. The haplotype tag SNPs or allele specific hybridization probes may detect polymorphisms at nucleotide position −3440, −3401, −3279, −3177, −3175, or −3156 from the UGT1A1 gene transcriptional start site. The kit may include haplotype tag SNPs or allele specific hybridization probes comprised in an oligonucleotide array or microarray.

[0026] Compositions of the invention include nucleic acids that can be used to determine the sequence at position −3516 of UGT1A or other reagents in that regard. Arrays and other assays for screening multiple samples are also included as part of the invention. Such compositions may be incorporated into kits or as part of a kit, along with any other composition discussed herein.

[0027] It is contemplated that any method or composition described herein can be implemented with respect to any other method or composition described herein. Similarly, any embodiment discussed with respect to one aspect of the invention may be used in the context of any other aspect of the invention.

[0028] Throughout this application, the term “about” is used to indicate that a value includes the standard deviation of error for the device or method being employed to determine the value.

[0029] The use of the word “a” or “an” when used in conjunction with the term “comprising” in the claims and/or the specification may mean “one,” but it is also consistent with the meaning of “one or more,” “at least one,” and “one or more than one.”

[0030] Other objects, features and advantages of the present invention will become apparent from the following detailed description. It should be understood, however, that the detailed description and the specific examples, while indicating specific embodiments of the invention, are given by way of illustration only, since various changes and modifications within the spirit and scope of the invention will become apparent to those skilled in the art from this detailed description.

BRIEF DESCRIPTION OF THE DRAWINGS

[0031] The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present invention. The invention may be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein.

[0032]FIG. 1 illustrates an exemplary phenobarbital-responsive enhancer module (PBREM) and description of polymorphisms. The previously described PBREM domains are underlined with the NR half-site sequences shown in bold. The polymorphic sites of the present application are included. The variants found at these sites are also listed. Positions indicated are from the first base of the UGT1A cluster sequence (Genbank accession No. AF297093).

[0033]FIG. 2A-2D illustrates (TA)_(n) genotype-phenotype relationship in human livers. (a) correlation in all samples investigated (n=83) (b) correlation in Caucasians (n=56) (c) correlation in African-Americans (n=15) (d) correlation in individuals of Asian (n=1), and unknown (n=10) ethnicity. Liver microsomes were phenotyped for SN-38 glucuronidation rates in each liver with a single experiment performed in triplicate. Bars show the mean value of SN-38 glucuronidation rates in each group.

[0034]FIG. 3 illustrates an exemplary haplotype-phenotype relationship in human livers of Caucasian and African origin (n=70). Bars show the mean value of SN-38 glucuronidation rates in each group. Only haplotypes with ≧2 samples are shown.

[0035]FIG. 4. Correlation between ANC and TA indel genotype. Bars represent the means. Nonparametric trend analysis (7/7<6/7<6/6, z=−2.72, p=0.01).

[0036]FIG. 5. Pre-treatment total bilirubin levels and distribution of the −3156 genotypes within each TA indel genotype. The −3156 AA genotypes are reported in squares, the GA genotypes in circles and the GG genotypes in triangles. Bars represent the mean values. A significant trend was reported (7/7>6/7>6/6, z=2.88, p<0.01, nonparametric trend analysis).

[0037]FIG. 6. Correlation between ln(ANC nadir) and pretreatment total bilirubin levels. Patients with bilirubin levels less than 0.6 mg/dl are depicted in squares. Those with bilirubin levels higher than 0.7 mg/dl are depicted in circles.

DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS

[0038] The present invention provides improved methods and compositions for identifying the effects of various polymorphisms, promoter polymorphisms or any combination thereof on the expression of UGT1A1 or the glucuronidation rate of UGT1A1 for the evaluation of the potential or risk for irinotecan toxicity in an individual or patient. The development of these improved methods and compositions allows for the use of such an evaluation to optimize treatment of a patient and to lower the risk of toxicity. In certain aspects of the invention various combinations of promoter polymorphisms may be used in this evaluation, in particular, polymorphisms in the PBREM region and polymorphisms in the TA repeats may be used.

[0039] Genetic variation in UGT1A1 expression has several important clinical implications. UGT1A1 basal transcription is affected by a polymorphic (TA) repeat. Another important regulatory element is the phenobarbital-responsive enhancer module (PBREM) which may contain variants affecting inducible gene expression. The examples provided herein study the extent of linkage disequilibrium between the (TA) polymorphism and variants in the PBREM and UGT1A1 promoter. The relationship between PBREM-(TA)_(n) haplotypes and the glucuronidation rate of the UGT1A1 substrate SN-38 is also addressed herein. Studies described in the Examples illustrate that SN-38G formation rates were correlated with (TA) genotypes and promoter variants. In vaious aspects particular (TA) variants are in linkage disequilibrium with various other polymorphisms.

[0040] Certain aspects of the invention are based on, but not limited to, the observation and characterization of novel polymorphisms in the PBREM region of the UGT1A1 gene. Due to the clinical implications of genetically modified regulation of UGT1A1 activity, the PBREM region was sequenced and polymorphisms in the TATA box of the UGT1A1 promoter genotyped, as described in the Examples section below.

[0041] I. Hepatic Glucuronidation by UGT Enzymes

[0042] Hepatic glucuronidation results from the activities of a multigene family of UGT enzymes, the members of which exhibit specificity for a variety of endogenous substrates and xenobiotics. The UGT enzymes are broadly classified into two distinct gene families. The UGT1 locus codes for multiple isoforms of UGT, all of which share a C-terminus encoded by a unique set of exons 2-5, but which have a variable N-terminus encoded by different first exons, each with its own independent promoter (Bosma et al., 1992; Ritter et al., 1992). The variable first exons confer the substrate specificity of the enzyme. Isoforms of the UGT2 family are unique gene products of which at least eight isozymes have been identified (Clarke et al. Handbook of Experimental Pharmacology 1994). The UGT1A1 isoform is the major bilirubin glucuronidation enzyme. Genetic defects in the UGT1A1 gene can result in decreased glucuronidation activity which leads to abnormally high levels of unconjugated serum bilirubin that may enter the brain and cause encephalopathy and kernicterus (Owens & Ritter, 1995). This condition is commonly known as Gilbert's syndrome. The molecular defect in Gilbert's Syndrome is a change in the TATA box within the UGT1A1 promoter (Bosma et al., 1995 and Monaghan et al., 1996). This promoter usually contains a (TA)₆ TAA element, but another allele, termed UGT1A1*28 or allele 7, is also present in human populations at high frequencies, and contains the sequence (TA)₇ TAA. This polymorphism in the promoter of the UGT1A1 gene results in reduced expression of the gene and accounts for most cases of Gilbert's Syndrome (Bosma et al., 1995). Overall, gene expression levels for the UGT1A1 promoter alleles are inversely related to the length of the TA repeat in the TATA box.

[0043] The variation observed in this promoter may also account for the inter-individual and inter-ethnic variation in drug metabolism and response to xenobiotic exposure. UGTs have been shown to contribute to the detoxification and elimination of both exogenous and endogenous compounds. For example, one typical role of the UGT1A1 isoform is the glucuronidation of SN-38 (7-ethyl-10-hydroxycamptothecin) to the corresponding glucuronide (10-O-glucuronyl-SN-38, SN-38G) as well as the glucuronidation of TAS-103 (6-[[2-(dimethylamino)ethyl]amino]-3-hydroxy-7H-indeno[2,1-c]quinoline-7-one dihydrochloride) to its corresponding glucuronide (TAS-103G). SN-38 is the active form of irinotecan (CPT-11, 7-ethyl-10-[4-(1-piperidino)-1-piperidino]carbonyloxycamptothecin), which is a camptothecin derivative used in the treatment of metastatic colorectal cancer and other malignancies. The metabolism of SN-38 and TAS-103 (also known as flavopiridol) are merely illustrative of the present invention the metabolism of other UGT1A1 substrates is also contemplated, such as estradiol, bilirubin, simple phenols, flavones, C18 steroids, complex phenols and coumarins.

[0044] Irinotecan is biotransformed by tissue and serum carboxylesterases to an active metabolite, SN-38, which has a 100-1,000-fold higher antitumor activity than irinotecan. SN-38 is glucuronidated by hepatic uridine diphosphate glucuronosyltransferases (UGTs) to form SN-38 glucuronide (10O-glucuronyl-SN-38, SN-38G), which is inactive and excreted into the bile and urine although, SN-38G might be deconjugated to form SN-38 by intestinal β-glucuronidase enzyme (Kaneda et al., 1990).

[0045] The major dose-limiting toxicities of irinotecan include diarrhea and, to a lesser extent, myelosuppression. irinotecan-induced diarrhea can be serious and often does not respond adequately to conventional antidiarrheal agents (Takasuna et al., 1995). This diarrhea may be due to direct enteric injury caused by the active metabolite, SN-38, which has been shown to accumulate in the intestine after intra peritoneal administration of irinotecan in athymic mice (Araki et al., 1993). The results of a recently completed phase I clinical trial, demonstrated that there was an inverse relationship between SN-38 glucuronidation rates and severity of diarrheal incidences in patients treated with increasing doses of Irinotecan (Gupta et al., 1994). These findings indicate that glucuronidation of SN-38 protects against Irinotecan-induced gastrointestinal toxicity. A complete discussion of the correlation between diarrhea and SN-38 glucuronidation, as well as a description of biochemical methods for determining glucurondation levels can be found in U.S. Pat. No. 5,786,344 and WO96/01127 which are both incorporated herein by reference in their entirety. Likewise, the results of studies using TAS-103 demonstrate that glucuronidation of TAS-103 may protect against TAS-103 induced toxicity. Therefore, the conversion of these two toxic compounds by hepatic UGTs demonstrates the importance of monitoring glucuronidation activity as an indicator of susceptibility to toxicity caused by exposure to compounds that are metabolized by UGTs. Furthermore, differential rates of SN-38 glucuronidation among subjects may explain the considerable inter-individual variation in the pharmacokinetic parameter estimates and toxicities observed after treatment with anti-cancer drugs or exposure to xenobiotics (Gupta et al., 1994; Gupta et al., 1997).

[0046] When two species, Gunn rats (Gunn, 1938) and CN-1 patients, that are deficient in UGT1A isoforms were screened for TAS-103 and SN-38 glucuronidation activity, there was approximately an 80% lower glucuronidation rate of TAS-103 in vitro and no in vitro glucuronidation of SN-38 compared to healthy liver donors. These results demonstrate the role of the UGT 1 family in catalyzing SN-38 and TAS-103 conjugation. Furthermore, these results demonstrate that the UGT2 family does not play a role in the glucuronidation of SN-38. On the other hand, while isoforms of the UGT1 family are the predominate isoforms involved in TAS-103 glucuronidation, the isoform of the UGT2 family may also participate in TAS-103 glucuronidation. Failure to glucuronidate SN-38 and TAS-103 in these instances may result specifically from the genetic defect in UGT1 gene family.

[0047] Other experiments confirm the association between the UGT1A1 isoform and SN-38 and TAS-103 glucuronidation. These studies show that substantial genetic variability exists in the UGT1A1 isoform family and particularly in the UGT1A1 promoter. This genetic variability has been shown to correlate with gene expression. For example, the presence of the 5 allele in the UGT1A1 promoter leads to increased gene expression while the presence of the 8 allele leads to reduced gene expression. Differences in gene expression levels may give rise to individuals with varying abilities to glucuronidate compounds metabolized by UGTs. This prediction was confirmed through a correlation analysis of UGT1A1 promoter genotype and rate of in vitro SN-38 and TAS-103 glucuronidation.

[0048] It follows therefore that individuals with the 8 allele may also have differing susceptibility to xenobiotics when compared to other genotypes when those compounds are metabolized by UGT1A1s. On the other hand, the presence of the 5 allele that correlates with increased gene expression and higher glucuronidation activity may result in the administration of less than optimum drug dosages. For example, when a drug metabolized by UGT1A1s is administered to an individual with this polymorphism, the increased glucuronidation activity may cause more of the drug to be converted into the inactive metabolite in a shorter period, thereby reducing the drug's effectiveness. Conversely, in the rare case of drugs and xenobiotics that require glucuronidation for activation, decreased glucuronidation activity may cause less of the activated form of the drug or xenobiotic to be available.

[0049] The fact that repeated sequences are intrinsically unstable and tend to lengthen and shorten as a result of unequal crossing-over during meiosis may explain the presence of other alleles, in addition to (TA)₆ and (TA)₇, in the population. Two additional alleles have been identified in human populations: allele 5, containing the sequence (TA)₅ TAA and allele 8, containing the sequence (TA)₈ TAA, see U.S. Pat. No. 6,395,481, which is incorporated in its entirety by reference. Interestingly, alleles 5 and 8 were found predominantly in population samples from Sub-Saharan Africa, where they occur at lower frequencies than the common alleles 6 and 7 although it is possible that these two alleles are present across a variety of ethnic groups. The frequency of alleles 6 and 7 also appears to differ significantly across ethnic groups, with Asian and Amerindian populations showing the highest frequencies of allele 6. Conversely, alleles 6 and 7 occur at intermediate and similar frequencies among Caucasians and Sub-Saharan Africans.

[0050] Several hypotheses may be proposed regarding the selective pressures that might be responsible for the observed pattern of inter-population variation at the UGT1A1 promoter. It was previously proposed that the maintenance of intermediate levels of bilirubin is adaptive (Beutler et al., 1998), and that the alleles at this promoter would be maintained in the population by balancing selection. This hypothesis is based on the observation that bilirubin is a potent antioxidant likely to have physiological significance (Stocker et al., 1987). However, it is also known that glucuronidation is an important detoxification step for many endogenous as well as exogenous compounds (Clarke & Burchell, 1994). In addition to TAS-103 and SN-38, UGT1A1 is likely to act on other substrates present in the environment, e.g., dietary components, environmental pollutants and carcinogens, which require detoxification as well as playing a role in the metabolism of bilirubin and other endogenous compounds. Within this framework, maintaining high levels of UGT1A1 gene expression would ensure rapid elimination of toxic or endogenous compounds and be advantageous.

[0051] As described herein, the correlation between in vitro glucuronidation rate and UGT1A1 promoter polymorphism found for alleles 6 and 7 has been shown to extend to alleles 5 and 8. Because these alleles appear to be more frequent in subsets of human populations (for example, those of African origin), an even higher inter-individual variability in SN-38 and TAS-103 metabolism might be expected within these populations. Because the inverse relationship between TA repeat size and rate of SN-38 glucuronidation extends to alleles 5 and 8, a screening assay that identifies these alleles can facilitate individualization of drug therapy, identify individuals susceptible to xenobiotic exposure, and can be used to improve drug dosage calculations.

[0052] II. Nucleic Acids

[0053] Certain embodiments of the present invention concern various nucleic acids, including promoters, amplification primers, oligonucleotide probes and other nucleic acid elements involved in the analysis of genomic DNA. In certain aspects, a nucleic acid comprises a wild-type, a mutant, or a polymorphic nucleic acid.

[0054] The term “nucleic acid” is well known in the art. A “nucleic acid” as used herein will generally refer to a molecule (i.e., a strand) of DNA, RNA or a derivative or analog thereof, comprising a nucleobase. A nucleobase includes, for example, a naturally occurring purine or pyrimidine base found in DNA (e.g., an adenine “A,” a guanine “G,” a thymine “T” or a cytosine “C”) or RNA (e.g., an A, a G, an uracil “U” or a C). The term “nucleic acid” encompass the terms “oligonucleotide” and “polynucleotide,” each as a subgenus of the term “nucleic acid.” The term “oligonucleotide” refers to a molecule of between about 3 and about 100 nucleobases in length. The term “polynucleotide” refers to at least one molecule of greater than about 100 nucleobases in length. A “gene” refers to coding sequence of a gene product, as well as introns and the promoter of the gene product. In addition to the UGT1A1 gene, other regulatory regions such as enhancers for UGT1A1 are contemplated as nucleic acids for use with compositions and methods of the claimed invention.

[0055] These definitions generally refer to a single-stranded molecule, but in specific embodiments will also encompass an additional strand that is partially, substantially or fully complementary to the single-stranded molecule. Thus, a nucleic acid may encompass a double-stranded molecule or a triple-stranded molecule that comprises one or more complementary strand(s) or “complement(s)” of a particular sequence comprising a molecule. As used herein, a single stranded nucleic acid may be denoted by the prefix “ss”, a double stranded nucleic acid by the prefix “ds”, and a triple stranded nucleic acid by the prefix “ts.”

[0056] In particular aspects, a nucleic acid encodes a protein, polypeptide, or peptide. In certain embodiments, the present invention concerns novel compositions comprising at least one proteinaceous molecule. As used herein, a “proteinaceous molecule,” “proteinaceous composition,” “proteinaceous compound,” “proteinaceous chain,” or “proteinaceous material” generally refers, but is not limited to, a protein of greater than about 200 amino acids or the full length endogenous sequence translated from a gene; a polypeptide of greater than about 100 amino acids; and/or a peptide of from about 3 to about 100 amino acids. All the “proteinaceous” terms described above may be used interchangeably herein.

[0057] 1. Preparation of Nucleic Acids

[0058] A nucleic acid may be made by any technique known to one of ordinary skill in the art, such as for example, chemical synthesis, enzymatic production or biological production. Non-limiting examples of a synthetic nucleic acid (e.g. a synthetic oligonucleotide), include a nucleic acid made by in vitro chemical synthesis using phosphotriester, phosphite or phosphoramidite chemistry and solid phase techniques such as described in European Patent 266,032, incorporated herein by reference, or via deoxynucleoside H-phosphonate intermediates as described by Froehler et al., 1986 and U.S. Pat. No. 5,705,629, each incorporated herein by reference. In the methods of the present invention, one or more oligonucleotide may be used. Various different mechanisms of oligonucleotide synthesis have been disclosed in for example, U.S. Pat. Nos. 4,659,774, 4,816,571, 5,141,813, 5,264,566, 4,959,463, 5,428,148, 5,554,744, 5,574,146, 5,602,244, each of which is incorporated herein by reference.

[0059] A non-limiting example of an enzymatically produced nucleic acid include one produced by enzymes in amplification reactions such as PCR™ (see for example, U.S. Pat. No. 4,683,202 and U.S. Pat. No. 4,682,195, each incorporated herein by reference), or the synthesis of an oligonucleotide described in U.S. Pat. No. 5,645,897, incorporated herein by reference. A non-limiting example of a biologically produced nucleic acid includes a recombinant nucleic acid produced (i.e., replicated) in a living cell, such as a recombinant DNA vector replicated in bacteria (see for example, Sambrook et al. 2001, incorporated herein by reference).

[0060] 2. Purification of Nucleic Acids

[0061] A nucleic acid may be purified on polyacrylamide gels, cesium chloride centrifugation gradients, chromatography columns or by any other means known to one of ordinary skill in the art (see for example, Sambrook et al., 2001, incorporated herein by reference). In some aspects, a nucleic acid is a pharmacologically acceptable nucleic acid. Pharmacologically acceptable compositions are known to those of skill in the art, and are described herein.

[0062] In certain aspects, the present invention concerns a nucleic acid that is an isolated nucleic acid. As used herein, the term “isolated nucleic acid” refers to a nucleic acid molecule (e.g., an RNA or DNA molecule) that has been isolated free of, or is otherwise free of, the bulk of the total genomic and transcribed nucleic acids of one or more cells. In certain embodiments, “isolated nucleic acid” refers to a nucleic acid that has been isolated free of, or is otherwise free of, bulk of cellular components or in vitro reaction components such as for example, macromolecules such as lipids or proteins, small biological molecules, and the like.

[0063] 3. Nucleic Acid Segments

[0064] In certain embodiments, the nucleic acid is a nucleic acid segment. As used herein, the term “nucleic acid segment,” are fragments of a nucleic acid, such as, for a non-limiting example, those that encode only part of a UGT1 gene locus or a UGT1A1 gene sequence. Thus, a “nucleic acid segment” may comprise any part of a gene sequence, including from about 2 nucleotides to the full length gene including promoter regions to the polyadenylation signal and any length that includes all the coding region.

[0065] Various nucleic acid segments may be designed based on a particular nucleic acid sequence, and may be of any length. By assigning numeric values to a sequence, for example, the first residue is 1, the second residue is 2, etc., an algorithm defining all nucleic acid segments can be created:

[0066] n to n+y

[0067] where n is an integer from 1 to the last number of the sequence and y is the length of the nucleic acid segment minus one, where n+y does not exceed the last number of the sequence. Thus, for a 10-mer, the nucleic acid segments correspond to bases 1 to 10, 2 to 11, 3 to 12 . . . and so on. For a 15-mer, the nucleic acid segments correspond to bases 1 to 15, 2 to 16, 3 to 17 . . . and so on. For a 20-mer, the nucleic segments correspond to bases 1 to 20, 2 to 21, 3 to 22 . . . and so on. In certain embodiments, the nucleic acid segment may be a probe or primer. As used herein, a “probe” generally refers to a nucleic acid used in a detection method or composition. As used herein, a “primer” generally refers to a nucleic acid used in an extension or amplification method or composition.

[0068] 4. Nucleic Acid Complements

[0069] The present invention also encompasses a nucleic acid that is complementary to a nucleic acid. A nucleic acid is “complement(s)” or is “complementary” to another nucleic acid when it is capable of base-pairing with another nucleic acid according to the standard Watson-Crick, Hoogsteen or reverse Hoogsteen binding complementarity rules. As used herein “another nucleic acid” may refer to a separate molecule or a spatial separated sequence of the same molecule. In preferred embodiments, a complement is a hybridization probe or amplification primer for the detection of a nucleic acid polymorphism.

[0070] As used herein, the term “complementary” or “complement” also refers to a nucleic acid comprising a sequence of consecutive nucleobases or semiconsecutive nucleobases (e.g., one or more nucleobase moieties are not present in the molecule) capable of hybridizing to another nucleic acid strand or duplex even if less than all the nucleobases do not base pair with a counterpart nucleobase. However, in some diagnostic or detection embodiments, completely complementary nucleic acids are preferred.

[0071] III. Nucleic Acid Detection

[0072] Some embodiments of the invention concern identifying polymorphisms in UGT1A1, correlating genotype or haplotype to phenotype, wherein the phenotype is lowered or altered UGT1A1 activity or expression, and then identifying such polymorphisms in patients who have or will be given irinotecan or related drugs or compounds. Thus, the present invention involves assays for identifying polymorphisms and other nucleic acid detection methods. Nucleic acids, therefore, have utility as probes or primers for embodiments involving nucleic acid hybridization. They may be used in diagnostic or screening methods of the present invention. Detection of nucleic acids encoding UGT1A1, as well as nucleic acids involved in the expression or stability of UGT1A1 polypeptides or transcripts, are encompassed by the invention. General methods of nucleic acid detection methods are provided below, followed by specific examples employed for the identification of polymorphisms, including single nucleotide polymorphisms (SNPs).

[0073] A. Hybridization

[0074] The use of a probe or primer of between 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or 15 and 50, 60, 70, 80, 90, or 100 nucleotides, preferably between 17 and 100 nucleotides in length, or in some aspects of the invention up to 1-2 kilobases or more in length, allows the formation of a duplex molecule that is both stable and selective. Molecules having complementary sequences over contiguous stretches greater than 20 bases in length are generally preferred, to increase stability and/or selectivity of the hybrid molecules obtained. One will generally prefer to design nucleic acid molecules for hybridization having one or more complementary sequences of 20 to 30 nucleotides, or even longer where desired. Such fragments may be readily prepared, for example, by directly synthesizing the fragment by chemical means or by introducing selected sequences into recombinant vectors for recombinant production.

[0075] Accordingly, the nucleotide sequences of the invention may be used for their ability to selectively form duplex molecules with complementary stretches of DNAs and/or RNAs or to provide primers for amplification of DNA or RNA from samples. Depending on the application envisioned, one would desire to employ varying conditions of hybridization to achieve varying degrees of selectivity of the probe or primers for the target sequence.

[0076] For applications requiring high selectivity, one will typically desire to employ relatively high stringency conditions to form the hybrids. For example, relatively low salt and/or high temperature conditions, such as provided by about 0.02 M to about 0.10 M NaCl at temperatures of about 50° C. to about 70° C. Such high stringency conditions tolerate little, if any, mismatch between the probe or primers and the template or target strand and would be particularly suitable for isolating specific genes or for detecting a specific polymorphism. It is generally appreciated that conditions can be rendered more stringent by the addition of increasing amounts of formamide. For example, under highly stringent conditions, hybridization to filter-bound DNA may be carried out in 0.5 M NaHPO₄, 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65° C., and washing in 0.1×SSC/0.1% SDS at 68° C. (Ausubel et al., 1989).

[0077] Conditions may be rendered less stringent by increasing salt concentration and/or decreasing temperature. For example, a medium stringency condition could be provided by about 0.1 to 0.25M NaCl at temperatures of about 37° C. to about 55° C., while a low stringency condition could be provided by about 0.15M to about 0.9M salt, at temperatures ranging from about 20° C. to about 55° C. Under low stringent conditions, such as moderately stringent conditions the washing may be carried out for example in 0.2×SSC/0.1% SDS at 42° C. (Ausubel et al., 1989). Hybridization conditions can be readily manipulated depending on the desired results.

[0078] In other embodiments, hybridization may be achieved under conditions of, for example, 50 mM Tris-HCl (pH 8.3), 75 mM KCl, 3 mM MgCl₂, 1.0 mM dithiothreitol, at temperatures between approximately 20° C. to about 37° C. Other hybridization conditions utilized could include approximately 10 mM Tris-HCl (pH 8.3), 50 mM KCl, 1.5 mM MgCl₂, at temperatures ranging from approximately 40° C. to about 72° C.

[0079] In certain embodiments, it will be advantageous to employ nucleic acids of defined sequences of the present invention in combination with an appropriate means, such as a label, for determining hybridization. A wide variety of appropriate indicator means are known in the art, including fluorescent, radioactive, enzymatic or other ligands, such as avidin/biotin, which are capable of being detected. In preferred embodiments, one may desire to employ a fluorescent label or an enzyme tag such as urease, alkaline phosphatase or peroxidase, instead of radioactive or other environmentally undesirable reagents. In the case of enzyme tags, colorimetric indicator substrates are known that can be employed to provide a detection means that is visibly or spectrophotometrically detectable, to identify specific hybridization with complementary nucleic acid containing samples. In other aspects, a particular nuclease cleavage site may be present and detection of a particular nucleotide sequence can be determined by the presence or absence of nucleic acid cleavage.

[0080] In general, it is envisioned that the probes or primers described herein will be useful as reagents in solution hybridization, as in PCR, for detection of expression or genotype of corresponding genes, as well as in embodiments employing a solid phase. In embodiments involving a solid phase, the test DNA (or RNA) is adsorbed or otherwise affixed to a selected matrix or surface. This fixed, single-stranded nucleic acid is then subjected to hybridization with selected probes under desired conditions. The conditions selected will depend on the particular circumstances (depending, for example, on the G+C content, type of target nucleic acid, source of nucleic acid, size of hybridization probe, etc.). Optimization of hybridization conditions for the particular application of interest is well known to those of skill in the art. After washing of the hybridized molecules to remove non-specifically bound probe molecules, hybridization is detected, and/or quantified, by determining the amount of bound label. Representative solid phase hybridization methods are disclosed in U.S. Pat. Nos. 5,843,663, 5,900,481 and 5,919,626. Other methods of hybridization that may be used in the practice of the present invention are disclosed in U.S. Pat. Nos. 5,849,481, 5,849,486 and 5,851,772. The relevant portions of these and other references identified in this section of the Specification are incorporated herein by reference.

[0081] B. Amplification of Nucleic Acids

[0082] Nucleic acids used as a template for amplification may be isolated from cells, tissues or other samples according to standard methodologies (Sambrook et al., 2001). In certain embodiments, analysis is performed on whole cell or tissue homogenates or biological fluid samples with or without substantial purification of the template nucleic acid. The nucleic acid may be genomic DNA or fractionated or whole cell RNA. Where RNA is used, it may be desired to first convert the RNA to a complementary DNA.

[0083] The term “primer,” as used herein, is meant to encompass any nucleic acid that is capable of priming the synthesis of a nascent nucleic acid in a template-dependent process. Typically, primers are oligonucleotides from ten to twenty and/or thirty base pairs in length, but longer sequences can be employed. Primers may be provided in double-stranded and/or single-stranded form, although the single-stranded form is preferred.

[0084] Pairs of primers designed to selectively hybridize to nucleic acids corresponding to the UGT1 gene locus (Genbank accession AF279093), UGT1A1 gene and/or SEQ ID NO:1 or variants thereof, and fragments thereof are contacted with the template nucleic acid under conditions that permit selective hybridization. SEQ ID NO:1 set forth a nucleotide sequence that includes a majority of the UGT1A1 gene. SEQ ID NO:1 includes nucleotides 169,831 to 187,313 of the UGT1 gene locus with nucleotide 1645 of SEQ ID NO:1 corresponding to nucleotide −3565 from the transcriptional start of the UGT1A1 gene, thus the transcriptional start is located at nucleotide 5212 of SEQ ID NO:1. Depending upon the desired application, high stringency hybridization conditions may be selected that will only allow hybridization to sequences that are completely complementary to the primers. In other embodiments, hybridization may occur under reduced stringency to allow for amplification of nucleic acids that contain one or more mismatches with the primer sequences. Once hybridized, the template-primer complex is contacted with one or more enzymes that facilitate template-dependent nucleic acid synthesis. Multiple rounds of amplification, also referred to as “cycles,” are conducted until a sufficient amount of amplification product is produced.

[0085] The amplification product may be detected, analyzed or quantified. In certain applications, the detection may be performed by visual means. In certain applications, the detection may involve indirect identification of the product via chemiluminescence, radioactive scintigraphy of incorporated radiolabel or fluorescent label or even via a system using electrical and/or thermal impulse signals (Affymax technology; Bellus, 1994).

[0086] A number of template dependent processes are available to amplify the oligonucleotide sequences present in a given template sample. One of the best known amplification methods is the polymerase chain reaction (referred to as PCR™) which is described in detail in U.S. Pat. Nos. 4,683,195, 4,683,202 and 4,800,159, and in Innis et al., 1988, each of which is incorporated herein by reference in their entirety.

[0087] Another method for amplification is ligase chain reaction (“LCR”), disclosed in European Application No. 320 308, incorporated herein by reference in its entirety. U.S. Pat. No. 4,883,750 describes a method similar to LCR for binding probe pairs to a target sequence. A method based on PCR™ and oligonucleotide ligase assay (OLA) (described in further detail below), disclosed in U.S. Pat. No. 5,912,148, may also be used.

[0088] Alternative methods for amplification of target nucleic acid sequences that may be used in the practice of the present invention are disclosed in U.S. Pat. Nos. 5,843,650, 5,846,709, 5,846,783, 5,849,546, 5,849,497, 5,849,547, 5,858,652, 5,866,366, 5,916,776, 5,922,574, 5,928,905, 5,928,906, 5,932,451, 5,935,825, 5,939,291 and 5,942,391, Great Britain Application 2 202 328, and in PCT Application PCT/US89/01025, each of which is incorporated herein by reference in its entirety. Qbeta Replicase, described in PCT Application PCT/US87/00880, may also be used as an amplification method in the present invention.

[0089] An isothermal amplification method, in which restriction endonucleases and ligases are used to achieve the amplification of target molecules that contain nucleotide 5′-[alpha-thio]-triphosphates in one strand of a restriction site may also be useful in the amplification of nucleic acids in the present invention (Walker et al., 1992). Strand Displacement Amplification (SDA), disclosed in U.S. Pat. No. 5,916,779, is another method of carrying out isothermal amplification of nucleic acids which involves multiple rounds of strand displacement and synthesis, i.e., nick translation

[0090] Other nucleic acid amplification procedures include transcription-based amplification systems (TAS), including nucleic acid sequence based amplification (NASBA) and 3SR (Kwoh et al., 1989; PCT Application WO 88/10315, incorporated herein by reference in their entirety). European Application 329 822 disclose a nucleic acid amplification process involving cyclically synthesizing single-stranded RNA (“ssRNA”), ssDNA, and double-stranded DNA (dsDNA), which may be used in accordance with the present invention.

[0091] PCT Application WO 89/06700 (incorporated herein by reference in its entirety) disclose a nucleic acid sequence amplification scheme based on the hybridization of a promoter region/primer sequence to a target single-stranded DNA (“ssDNA”) followed by transcription of many RNA copies of the sequence. This scheme is not cyclic, i.e., new templates are not produced from the resultant RNA transcripts. Other amplification methods include “RACE” and “one-sided PCR” (Frohman, 1990; Ohara et al., 1989).

[0092] C. Detection of Nucleic Acids

[0093] Following any amplification, it may be desirable to separate the amplification product from the template and/or the excess primer. In one embodiment, amplification products are separated by agarose, agarose-acrylamide or polyacrylamide gel electrophoresis using standard methods (Sambrook et al., 2001). Separated amplification products may be cut out and eluted from the gel for further manipulation. Using low melting point agarose gels, the separated band may be removed by heating the gel, followed by extraction of the nucleic acid.

[0094] Separation of nucleic acids may also be effected by spin columns and/or chromatographic techniques known in art. There are many kinds of chromatography which may be used in the practice of the present invention, including adsorption, partition, ion-exchange, hydroxylapatite, molecular sieve, reverse-phase, column, paper, thin-layer, and gas chromatography as well as HPLC.

[0095] In certain embodiments, the amplification products are visualized, with or without separation. A typical visualization method involves staining of a gel with ethidium bromide and visualization of bands under UV light. Alternatively, if the amplification products are integrally labeled with radio- or fluorometrically-labeled nucleotides, the separated amplification products can be exposed to x-ray film or visualized under the appropriate excitatory spectra.

[0096] In one embodiment, following separation of amplification products, a labeled nucleic acid probe is brought into contact with the amplified marker sequence. The probe preferably is conjugated to a chromophore but may be radiolabeled. In another embodiment, the probe is conjugated to a binding partner, such as an antibody or biotin, or another binding partner carrying a detectable moiety.

[0097] In particular embodiments, detection is by Southern blotting and hybridization with a labeled probe. The techniques involved in Southern blotting are well known to those of skill in the art (see Sambrook et al., 2001). One example of the foregoing is described in U.S. Pat. No. 5,279,721, incorporated by reference herein, which discloses an apparatus and method for the automated electrophoresis and transfer of nucleic acids. The apparatus permits electrophoresis and blotting without external manipulation of the gel and is ideally suited to carrying out methods according to the present invention.

[0098] Other methods of nucleic acid detection that may be used in the practice of the instant invention are disclosed in U.S. Pat. Nos. 5,840,873, 5,843,640, 5,843,651, 5,846,708, 5,846,717, 5,846,726, 5,846,729, 5,849,487, 5,853,990, 5,853,992, 5,853,993, 5,856,092, 5,861,244, 5,863,732, 5,863,753, 5,866,331, 5,905,024, 5,910,407, 5,912,124, 5,912,145, 5,919,630, 5,925,517, 5,928,862, 5,928,869, 5,929,227, 5,932,413 and 5,935,791, each of which is incorporated herein by reference.

[0099] D. Other Assays

[0100] Other methods for genetic screening may be used within the scope of the present invention, for example, to detect mutations in genomic DNA, cDNA and/or RNA samples. Methods used to detect point mutations include denaturing gradient gel electrophoresis (“DGGE”), restriction fragment length polymorphism analysis (“RFLP”), chemical or enzymatic cleavage methods, direct sequencing of target regions amplified by PCR™ (see above), single-strand conformation polymorphism analysis (“SSCP”) and other methods well known in the art.

[0101] One method of screening for point mutations is based on RNase cleavage of base pair mismatches in RNA/DNA or RNA/RNA heteroduplexes. As used herein, the term “mismatch” is defined as a region of one or more unpaired or mispaired nucleotides in a double-stranded RNA/RNA, RNA/DNA or DNA/DNA molecule. This definition thus includes mismatches due to insertion/deletion mutations, as well as single or multiple base point mutations.

[0102] U.S. Pat. No. 4,946,773 describes an RNase A mismatch cleavage assay that involves annealing single-stranded DNA or RNA test samples to an RNA probe, and subsequent treatment of the nucleic acid duplexes with RNase A. For the detection of mismatches, the single-stranded products of the RNase A treatment, electrophoretically separated according to size, are compared to similarly treated control duplexes. Samples containing smaller fragments (cleavage products) not seen in the control duplex are scored as positive.

[0103] Other investigators have described the use of RNase I in mismatch assays. The use of RNase I for mismatch detection is described in literature from Promega Biotech. Promega markets a kit containing RNase I that is reported to cleave three out of four known mismatches. Others have described using the MutS protein or other DNA-repair enzymes for detection of single-base mismatches.

[0104] Alternative methods for detection of deletion, insertion or substitution mutations that may be used in the practice of the present invention are disclosed in U.S. Pat. Nos. 5,849,483, 5,851,770, 5,866,337, 5,925,525 and 5,928,870, each of which is incorporated herein by reference in its entirety.

[0105] E. Specific Examples of SNP Screening Methods

[0106] Spontaneous mutations that arise during the course of evolution in the genomes of organisms are often not immediately transmitted throughout all of the members of the species, thereby creating polymorphic alleles that co-exist in the species populations. Often polymorphisms are the cause of genetic diseases. Several classes of polymorphisms have been identified. For example, variable nucleotide type polymorphisms (VNTRs), arise from spontaneous tandem duplications of di- or trinucleotide repeated motifs of nucleotides. If such variations alter the lengths of DNA fragments generated by restriction endonuclease cleavage, the variations are referred to as restriction fragment length polymorphisms (RFLPs). RFLPs are been widely used in human and animal genetic analyses.

[0107] Another class of polymorphisms are generated by the replacement of a single nucleotide. Such single nucleotide polymorphisms (SNPs) rarely result in changes in a restriction endonuclease site. Thus, SNPs are rarely detectable restriction fragment length analysis. SNPs are the most common genetic variations and occur once every 100 to 300 bases and several SNP mutations have been found that affect a single nucleotide in a protein-encoding gene in a manner sufficient to actually cause a genetic disease. SNP diseases are exemplified by hemophilia, sickle-cell anemia, hereditary hemochromatosis, late-onset alzheimer disease etc.

[0108] In context of the present invention, polymorphic mutations that affect the activity and/or levels of the UGT1A1 gene products, which are responsible for the glucuronidation of irinotecan and other chemotherapeutic and xenobiotic agents, will be determined by a series of screening methods. One set of screening methods is aimed at identifying SNPs that affect the inducibility, activity and/or level of the UGT1A1 gene products in in vitro or in vivo assays. The other set of screening methods will then be performed to screen an individual for the occurrence of the SNPs identified above. To do this, a sample (such as blood or other bodily fluid or tissue sample) will be taken from a patient for genotype analysis. The presence or absence of SNPs will determine the ability of the screened individuals to metabolize irinotecan and other chemotherapeutic agents that are metabolized by the UGT1A1 gene products. According to methods provided by the invention, these results will be used to adjust and/or alter the dose of irinotecan or other agent administered to an individual in order to reduce drug side effects.

[0109] SNPs can be the result of deletions, point mutations and insertions and in general any single base alteration, whatever the cause, can result in a SNP. The greater frequency of SNPs means that they can be more readily identified than the other classes of polymorphisms. The greater uniformity of their distribution permits the identification of SNPs “nearer” to a particular trait of interest. The combined effect of these two attributes makes SNPs extremely valuable. For example, if a particular trait (e.g., inability to efficiently metabolize irinotecan) reflects a mutation at a particular locus, then any polymorphism that is linked to the particular locus can be used to predict the probability that an individual will be exhibit that trait.

[0110] Several methods have been developed to screen polymorphisms and some examples are listed below. The reference of Kwok and Chen (2003) and Kwok (2001) provide overviews of some of these methods; both of these references are specifically incorporated by reference.

[0111] SNPs relating to glucuronidation of chemotherapeutic agents can be characterized by the use of any of these methods or suitable modification thereof. Such methods include the direct or indirect sequencing of the site, the use of restriction enzymes where the respective alleles of the site create or destroy a restriction site, the use of allele-specific hybridization probes, the use of antibodies that are specific for the proteins encoded by the different alleles of the polymorphism, or any other biochemical interpretation.

[0112] i) DNA Sequencing

[0113] The most commonly used method of characterizing a polymorphism is direct DNA sequencing of the genetic locus that flanks and includes the polymorphism. Such analysis can be accomplished using either the “dideoxy-mediated chain termination method,” also known as the “Sanger Method” (Sanger, F., et al., 1975) or the “chemical degradation method,” also known as the “Maxam-Gilbert method” (Maxam, A. M., et al., 1977). Sequencing in combination with genomic sequence-specific amplification technologies, such as the polymerase chain reaction may be utilized to facilitate the recovery of the desired genes (Mullis, K. et al., 1986; European Patent Application 50,424; European Patent Application. 84,796, European Patent Application 258,017, European Patent Application. 237,362; European Patent Application. 201,184; U.S. Pat. Nos. 4,683,202; 4,582,788; and 4,683,194), all of the above incorporated herein by reference.

[0114] ii) Exonuclease Resistance

[0115] Other methods that can be employed to determine the identity of a nucleotide present at a polymorphic site utilize a specialized exonuclease-resistant nucleotide derivative (U.S. Pat. No. 4,656,127). A primer complementary to an allelic sequence immediately 3′- to the polymorphic site is hybridized to the DNA under investigation. If the polymorphic site on the DNA contains a nucleotide that is complementary to the particular exonucleotide-resistant nucleotide derivative present, then that derivative will be incorporated by a polymerase onto the end of the hybridized primer. Such incorporation makes the primer resistant to exonuclease cleavage and thereby permits its detection. As the identity of the exonucleotide-resistant derivative is known one can determine the specific nucleotide present in the polymorphic site of the DNA.

[0116] iii) Microsequencing Methods

[0117] Several other primer-guided nucleotide incorporation procedures for assaying polymorphic sites in DNA have been described (Komher, J. S. et al., 1989; Sokolov, B. P., 1990; Syvanen 1990; Kuppuswamy et al., 1991; Prezant et al., 1992; Ugozzoll, L. et al., 1992; Nyren et al., 1993). These methods rely on the incorporation of labeled deoxynucleotides to discriminate between bases at a polymorphic site. As the signal is proportional to the number of deoxynucleotides incorporated, polymorphisms that occur in runs of the same nucleotide result in a signal that is proportional to the length of the run (Syvanen et al., 1990).

[0118] iv) Extension in Solution

[0119] French Patent 2,650,840 and PCT Application WO91/02087 discuss a solution-based method for determining the identity of the nucleotide of a polymorphic site. According to these methods, a primer complementary to allelic sequences immediately 3′- to a polymorphic site is used. The identity of the nucleotide of that site is determined using labeled dideoxynucleotide derivatives which are incorporated at the end of the primer if complementary to the nucleotide of the polymorphic site.

[0120] v) Genetic Bit Analysis or Solid-Phase Extension

[0121] PCT Application WO92/15712 describes a method that uses mixtures of labeled terminators and a primer that is complementary to the sequence 3′ to a polymorphic site. The labeled terminator that is incorporated is complementary to the nucleotide present in the polymorphic site of the target molecule being evaluated and is thus identified. Here the primer or the target molecule is immobilized to a solid phase.

[0122] vi) Oligonucleotide Ligation Assay (OLA)

[0123] This is another solid phase method that uses different methodology (Landegren et al., 1988). Two oligonucleotides, capable of hybridizing to abutting sequences of a single strand of a target DNA are used. One of these oligonucleotides is biotinylated while the other is detectably labeled. If the precise complementary sequence is found in a target molecule, the oligonucleotides will hybridize such that their termini abut, and create a ligation substrate. Ligation permits the recovery of the labeled oligonucleotide by using avidin. Other nucleic acid detection assays, based on this method, combined with PCR have also been described (Nickerson et al., 1990). Here PCR is used to achieve the exponential amplification of target DNA, which is then detected using the OLA.

[0124] vii) Ligase/Polymerase-Mediated Genetic Bit Analysis

[0125] U.S. Pat. No. 5,952,174 describes a method that also involves two primers capable of hybridizing to abutting sequences of a target molecule. The hybridized product is formed on a solid support to which the target is immobilized. Here the hybridization occurs such that the primers are separated from one another by a space of a single nucleotide. Incubating this hybridized product in the presence of a polymerase, a ligase, and a nucleoside triphosphate mixture containing at least one deoxynucleoside triphosphate allows the ligation of any pair of abutting hybridized oligonucleotides. Addition of a ligase results in two events required to generate a signal, extension and ligation. This provides a higher specificity and lower “noise” than methods using either extension or ligation alone and unlike the polymerase-based assays, this method enhances the specificity of the polymerase step by combining it with a second hybridization and a ligation step for a signal to be attached to the solid phase.

[0126] viii) Other Methods to Detect SNPs

[0127] Several other specific methods for SNP detection and identification are presented below and may be used as such or with suitable modifications in conjunction with identifying polymorphisms of the UGT1A1 genes in the present invention. Several other methods are also described on the SNP web site of the NCBI at the website www.ncbi.nlm.nih.gov/SNP, incorporated herein by reference.

[0128] In a particular embodiment, extended haplotypes may be determined at any given locus in a population, which allows one to identify exactly which SNPs will be redundant and which will be essential in association studies. The latter is referred to as ‘haplotype tag SNPs (htSNPs)’, markers that capture the haplotypes of a gene or a region of linkage disequilibrium. See Johnson et al. (2001) and Ke and Cardon (2003), each of which is incorporated herein by reference, for exemplary methods.

[0129] The VDA-assay utilizes PCR amplification of genomic segments by long PCR methods using TaKaRa LA Taq reagents and other standard reaction conditions. The long amplification can amplify DNA sizes of about 2,000-12,000 bp. Hybridization of products to variant detector array (VDA) can be performed by a Affymetrix High Throughput Screening Center and analyzed with computerized software.

[0130] A method called Chip Assay uses PCR amplification of genomic segments by standard or long PCR protocols. Hybridization products are analyzed by VDA, Halushka et al., 1999, incorporated herein by reference. SNPs are generally classified as “Certain” or “Likely” based on computer analysis of hybridization patterns. By comparison to alternative detection methods such as nucleotide sequencing, “Certain” SNPs have been confirmed 100% of the time; and “Likely” SNPs have been confirmed 73% of the time by this method.

[0131] Other methods simply involve PCR amplification following digestion with the relevant restriction enzyme. Yet others involve sequencing of purified PCR products from known genomic regions.

[0132] In yet another method, individual exons or overlapping fragments of large exons are PCR-amplified. Primers are designed from published or database sequences and PCR-amplification of genomic DNA is performed using the following conditions: 200 ng DNA template, 0.5 μM each primer, 80 μM each of dCTP, dATP, dTTP and dGTP, 5% formamide, 1.5 mM MgCl₂, 0.5 U of Taq polymerase and 0.1 volume of the Taq buffer. Thermal cycling is performed and resulting PCR-products are analyzed by PCR-single strand conformation polymorphism (PCR-SSCP) analysis, under a variety of conditions, e.g, 5 or 10% polyacrylamide gel with 15% urea, with or without 5% glycerol. Electrophoresis is performed overnight. PCR-products that show mobility shifts are reamplified and sequenced to identify nucleotide variation.

[0133] In a method called CGAP-GAI (DEMIGLACE), sequence and alignment data (from a PHRAP.ace file), quality scores for the sequence base calls (from PHRED quality files), distance information (from PHYLIP dnadist and neighbour programs) and base-calling data (from PHRED ‘-d’ switch) are loaded into memory. Sequences are aligned and examined for each vertical chunk (‘slice’) of the resulting assembly for disagreement. Any such slice is considered a candidate SNP (DEMIGLACE). A number of filters are used by DEMIGLACE to eliminate slices that are not likely to represent true polymorphisms. These include filters that: (i) exclude sequences in any given slice from SNP consideration where neighboring sequence quality scores drop 40% or more; (ii) exclude calls in which peak amplitude is below the fifteenth percentile of all base calls for that nucleotide type; (iii) disqualify regions of a sequence having a high number of disagreements with the consensus from participating in SNP calculations; (iv) removed from consideration any base call with an alternative call in which the peak takes up 25% or more of the area of the called peak; (v) exclude variations that occur in only one read direction. PHRED quality scores were converted into probability-of-error values for each nucleotide in the slice. Standard Baysian methods are used to calculate the posterior probability that there is evidence of nucleotide heterogeneity at a given location.

[0134] In a method called CU-RDF (RESEQ), PCR amplification is performed from DNA isolated from blood using specific primers for each SNP, and after typical cleanup protocols to remove unused primers and free nucleotides, direct sequencing using the same or nested primers.

[0135] In a method called DEBNICK (METHOD-B), a comparative analysis of clustered EST sequences is performed and confirmed by fluorescent-based DNA sequencing. In a related method, called DEBNICK (METHOD-C), comparative analysis of clustered EST sequences with phred quality>20 at the site of the mismatch, average phred quality>=20 over 5 bases 5′-FLANK and 3′ to the SNP, no mismatches in 5 bases 5′ and 3′ to the SNP, at least two occurrences of each allele is performed and confirmed by examining traces.

[0136] In a method identified by ERO (RESEQ), new primers sets are designed for electronically published STSs and used to amplify DNA from 10 different mouse strains. The amplification product from each strain is then gel purified and sequenced using a standard dideoxy, cycle sequencing technique with ³³P-labeled terminators. All the ddATP terminated reactions are then loaded in adjacent lanes of a sequencing gel followed by all of the ddGTP reactions and so on. SNPs are identified by visually scanning the radiographs.

[0137] In another method identified as ERO (RESEQ-HT), new primers sets are designed for electronically published murine DNA sequences and used to amplify DNA from 10 different mouse strains. The amplification product from each strain is prepared for sequencing by treating with Exonuclease I and Shrimp Alkaline Phosphatase. Sequencing is performed using ABI Prism Big Dye Terminator Ready Reaction Kit (Perkin-Elmer) and sequence samples are run on the 3700 DNA Analyzer (96 Capillary Sequencer).

[0138] FGU-CBT (SCA2-SNP) identifies a method where the region containing the SNP is PCR amplified using the primers SCA2-FP3 and SCA2-RP3. Approximately 100 ng of genomic DNA is amplified in a 50 ml reaction volume containing a final concentration of 5 mM Tris, 25 mM KCl, 0.75 mM MgCl₂, 0.05% gelatin, 20 pmol of each primer and 0.5 U of Taq DNA polymerase. Samples are denatured, annealed and extended and the PCR product is purified from a band cut out of the agarose gel using, for example, the QIAquick gel extraction kit (Qiagen) and is sequenced using dye terminator chemistry on an ABI Prism 377 automated DNA sequencer with the PCR primers.

[0139] In a method identified as JBLACK (SEQ/RESTRICT), two independent PCR reactions are performed with genomic DNA. Products from the first reaction are analyzed by sequencing, indicating a unique FspI restriction site. The mutation is confirmed in the product of the second PCR reaction by digesting with Fsp I.

[0140] In a method described as KWOK(1), SNPs are identified by comparing high quality genomic sequence data from four randomly chosen individuals by direct DNA sequencing of PCR products with dye-terminator chemistry (see Kwok et al., 1996). In a related method identified as KWOK (2) SNPs) are identified by comparing high quality genomic sequence data from overlapping large-insert clones such as bacterial artificial chromosomes (BACs) or P1-based artificial chromosomes (PACs). An STS containing this SNP is then developed and the existence of the SNP in various populations is confirmed by pooled DNA sequencing (see Taillon-Miller et al., 1998). In another similar method called KWOK(3), SNPs are identified by comparing high quality genomic sequence data from overlapping large-insert clones BACs or PACs. The SNPs found by this approach represent DNA sequence variations between the two donor chromosomes but the allele frequencies in the general population have not yet been determined. In method KWOK(5), SNPs are identified by comparing high quality genomic sequence data from a homozygous DNA sample and one or more pooled DNA samples by direct DNA sequencing of PCR products with dye-terminator chemistry. The STSs used are developed from sequence data found in publicly available databases. Specifically, these STSs are amplified by PCR against a complete hydatidiform mole (CHM) that has been shown to be homozygous at all loci and a pool of DNA samples from 80 CEPH parents (see Kwok et al., 1994).

[0141] In another such method, KWOK (OverlapSnpDetectionWithPolyBayes), SNPs are discovered by automated computer analysis of overlapping regions of large-insert human genomic clone sequences. For data acquisition, clone sequences are obtained directly from large-scale sequencing centers. This is necessary because base quality sequences are not present/available through GenBank. Raw data processing involves analyzed of clone sequences and accompanying base quality information for consistency. Finished (‘base perfect’, error rate lower than 1 in 10,000 bp) sequences with no associated base quality sequences are assigned a uniform base quality value of 40 (1 in 10,000 bp error rate). Draft sequences without base quality values are rejected. Processed sequences are entered into a local database. A version of each sequence with known human repeats masked is also stored. Repeat masking is performed with the program “MASKERAID.” Overlap detection: Putative overlaps are detected with the program “WUBLAST.” Several filtering steps followed in order to eliminate false overlap detection results, i.e. similarities between a pair of clone sequences that arise due to sequence duplication as opposed to true overlap. Total length of overlap, overall percent similarity, number of sequence differences between nucleotides with high base quality value “high-quality mismatches.” Results are also compared to results of restriction fragment mapping of genomic clones at Washington University Genome Sequencing Center, finisher's reports on overlaps, and results of the sequence contig building effort at the NCBI. SNP detection: Overlapping pairs of clone sequence are analyzed for candidate SNP sites with the ‘POLYBAYES’ SNP detection software. Sequence differences between the pair of sequences are scored for the probability of representing true sequence variation as opposed to sequencing error. This process requires the presence of base quality values for both sequences. High-scoring candidates are extracted. The search is restricted to substitution-type single base pair variations. Confidence score of candidate SNP is computed by the POLYBAYES software.

[0142] In method identified by KWOK (TaqMan assay), the TaqMan assay is used to determine genotypes for 90 random individuals. In method identified by KYUGEN(Q1), DNA samples of indicated populations are pooled and analyzed by PLACE-SSCP. Peak heights of each allele in the pooled analysis are corrected by those in a heterozygote, and are subsequently used for calculation of allele frequencies. Allele frequencies higher than 10% are reliably quantified by this method. Allele frequency=0 (zero) means that the allele was found among individuals, but the corresponding peak is not seen in the examination of pool. Allele frequency=0-0.1 indicates that minor alleles are detected in the pool but the peaks are too low to reliably quantify.

[0143] In yet another method identified as KYUGEN (Method1), PCR products are post-labeled with fluorescent dyes and analyzed by an automated capillary electrophoresis system under SSCP conditions (PLACE-SSCP). Four or more individual DNAs are analyzed with or without two pooled DNA (Japanese pool and CEPH parents pool) in a series of experiments. Alleles are identified by visual inspection. Individual DNAs with different genotypes are sequenced and SNPs identified. Allele frequencies are estimated from peak heights in the pooled samples after correction of signal bias using peak heights in heterozygotes. For the PCR primers are tagged to have 5′-ATT or 5′-GTT at their ends for post-labeling of both strands. Samples of DNA (10 ng/ul) are amplified in reaction mixtures containing the buffer (10 mM Tris-HCl, pH 8.3 or 9.3, 50 mM KCl, 2.0 mM MgCl₂), 0.25 μM of each primer, 200 μM of each dNTP, and 0.025 units/μl of Taq DNA polymerase premixed with anti-Taq antibody. The two strands of PCR products are differentially labeled with nucleotides modified with R110 and R6G by an exchange reaction of Klenow fragment of DNA polymerase I. The reaction is stopped by adding EDTA, and unincorporated nucleotides are dephosphorylated by adding calf intestinal alkaline phosphatase. For the SSCP: an aliquot of fluorescently labeled PCR products and TAMRA-labeled internal markers are added to deionized formamide, and denatured. Electrophoresis is performed in a capillary using an ABI Prism 310 Genetic Analyzer. Genescan softwares (P-E Biosystems) are used for data collection and data processing. DNA of individuals (two to eleven) including those who showed different genotypes on SSCP are subjected for direct sequencing using big-dye terminator chemistry, on ABI Prism 310 sequencers. Multiple sequence trace files obtained from ABI Prism 310 are processed and aligned by Phred/Phrap and viewed using Consed viewer. SNPs are identified by PolyPhred software and visual inspection.

[0144] In yet another method identified as KYUGEN (Method2), individuals with different genotypes are searched by denaturing HPLC (DHPLC) or PLACE-SSCP (Inazuka et al., 1997) and their sequences are determined to identify SNPs. PCR is performed with primers tagged with 5′-ATT or 5′-GTT at their ends for post-labeling of both strands. DHPLC analysis is carried out using the WAVE DNA fragment analysis system (Transgenomic). PCR products are injected into DNASep column, and separated under the conditions determined using WAVEMaker program (Transgenomic). The two strands of PCR products that are differentially labeled with nucleotides modified with R110 and R6G by an exchange reaction of Klenow fragment of DNA polymerase I. The reaction is stopped by adding EDTA, and unincorporated nucleotides are dephosphorylated by adding calf intestinal alkaline phosphatase. SSCP followed by electrophoresis is performed in a capillary using an ABI Prism 310 Genetic Analyzer. Genescan softwares (P-E Biosystems). DNA of individuals including those who showed different genotypes on DHPLC or SSCP are subjected for direct sequencing using big-dye terminator chemistry, on ABI Prism 310 sequencer. Multiple sequence trace files obtained from ABI Prism 310 are processed and aligned by Phred/Phrap and viewed using Consed viewer. SNPs are identified by PolyPhred software and visual inspection. Trace chromatogram data of EST sequences in Unigene are processed with PHRED. To identify likely SNPs, single base mismatches are reported from multiple sequence alignments produced by the programs PHRAP, BRO and POA for each Unigene cluster. BRO corrected possible misreported EST orientations, while POA identified and analyzed non-linear alignment structures indicative of gene mixing/chimeras that might produce spurious SNPs. Bayesian inference is used to weigh evidence for true polymorphism versus sequencing error, misalignment or ambiguity, misclustering or chimeric EST sequences, assessing data such as raw chromatogram height, sharpness, overlap and spacing; sequencing error rates; context-sensitivity; cDNA library origin, etc.

[0145] In method identified as MARSHFIELD (Method-B), overlapping human DNA sequences which contained putative insertion/deletion polymorphisms are identified through searches of public databases. PCR primers which flanked each polymorphic site are selected from the consensus sequences. Primers are used to amplify individual or pooled human genomic DNA. Resulting PCR products are resolved on a denaturing polyacrylamide gel and a PhosphorImager is used to estimate allele frequencies from DNA pools.

[0146] IV. Pharmaceutical Compositions

[0147] Aqueous compositions may have an effective amount of irinotecan and/or an effective amount of a compound (second agent) that increases conjugative enzyme activity, as represented by a compound that increases the activity of the phase II conjugative enzyme, glucuronosyltransferase or that decreases biliary transport. Such compositions will generally be dissolved or dispersed in a pharmaceutically acceptable carrier or aqueous medium.

[0148] The phrases “pharmaceutically or pharmacologically acceptable” refer to molecular entities and compositions that do not produce an adverse, allergic or other untoward reaction when administered to an animal, or human, as appropriate. As used herein, “pharmaceutically acceptable carrier” includes any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents and the like. The use of such media and agents for pharmaceutical active substances is well known in the art. Except insofar as any conventional media or agent is incompatible with the active ingredients, its use in the therapeutic compositions is contemplated. Supplementary active ingredients, such as other anti-cancer agents, can also be incorporated into the compositions.

[0149] In addition to the compounds formulated for parenteral administration, such as intravenous or intramuscular injection, other pharmaceutically acceptable forms include, e.g., tablets or other solids for oral administration; time release capsules; and any other form currently used, including cremes, lotions, mouthwashes, inhalants and the like.

[0150] A. Parenteral Administration

[0151] The active compounds will often be formulated for parenteral administration, e.g., formulated for injection via the intravenous, intramuscular, sub-cutaneous, or even intraperitoneal routes. The preparation of an aqueous composition that contains irinotecan and a second agent as active ingredients will be known to those of skill in the art in light of the present disclosure. Typically, such compositions can be prepared as injectables, either as liquid solutions or suspensions; solid forms suitable for using to prepare solutions or suspensions upon the addition of a liquid prior to injection can also be prepared; and the preparations can also be emulsified.

[0152] Solutions of the active compounds as free base or pharmacologically acceptable salts can be prepared in water suitably mixed with a surfactant, such as hydroxypropylcellulose. Dispersions can also be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of storage and use, these preparations contain a preservative to prevent the growth of microorganisms.

[0153] The pharmaceutical forms suitable for injectable use include sterile aqueous solutions or dispersions; formulations including sesame oil, peanut oil or aqueous propylene glycol; and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersions. In all cases the form must be sterile and must be fluid to the extent that easy syringability exists. It must be stable under the conditions of manufacture and storage and must be preserved against the contaminating action of microorganisms, such as bacteria and fungi.

[0154] The active compounds may be formulated into a composition in a neutral or salt form. Pharmaceutically acceptable salts, include the acid addition salts (formed with the free amino groups of the protein) and which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, histidine, procaine and the like.

[0155] The carrier can also be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyethylene glycol, and the like), suitable mixtures thereof, and vegetable oils. The proper fluidity can be maintained, for example, by the use of a coating, such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. The prevention of the action of microorganisms can be brought about by various antibacterial ad antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the injectable compositions can be brought about by the use in the compositions of agents delaying absorption, for example, aluminum monostearate and gelatin.

[0156] Sterile injectable solutions are prepared by incorporating the active compounds in the required amount in the appropriate solvent with various of the other ingredients enumerated above, as required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the various sterilized active ingredients into a sterile vehicle which contains the basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, the preferred methods of preparation are vacuum-drying and freeze-drying techniques which yield a powder of the active ingredient plus any additional desired ingredient from a previously sterile-filtered solution thereof.

[0157] Upon formulation, solutions will be administered in a manner compatible with the dosage formulation and in such amount as is therapeutically effective. The formulations are easily administered in a variety of dosage forms, such as the type of injectable solutions described above, with even drug release capsules and the like being employable.

[0158] For parenteral administration in an aqueous solution, for example, the solution should be suitably buffered if necessary and the liquid diluent first rendered isotonic with sufficient saline or glucose. These particular aqueous solutions are especially suitable for intravenous, intramuscular, subcutaneous and intraperitoneal administration. In this connection, sterile aqueous media which can be employed will be known to those of skill in the art in light of the present disclosure. For example, one dosage could be dissolved in 1 mL of isotonic NaCl solution and either added to 1000 mL of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, “Remington's Pharmaceutical Sciences” 15th Edition, pages 1035-1038 and 1570-1580). Some variation in dosage will necessarily occur depending on the condition of the subject being treated. The person responsible for administration will, in any event, determine the appropriate dose for the individual subject.

[0159] B. Oral Administration

[0160] In certain embodiments, active compounds may be administered orally. This is contemplated for agents which are generally resistant, or have been rendered resistant, to proteolysis by digestive enzymes. Such compounds are contemplated to include all those compounds, or drugs, that are available in tablet form from the manufacturer and derivatives and analogues thereof.

[0161] For oral administration, the active compounds may be administered, for example, with an inert diluent or with an assimilable edible carrier, or they may be enclosed in hard or soft shell gelatin capsule, or compressed into tablets, or incorporated directly with the food of the diet. For oral therapeutic administration, the active compounds may be incorporated with excipients and used in the form of ingestible tablets, buccal tables, troches, capsules, elixirs, suspensions, syrups, wafers, and the like. Such compositions and preparations should contain at least 0.1% of active compound. The percentage of the compositions and preparations may, of course, be varied and may conveniently be between about 2 to about 60% of the weight of the unit. The amount of active compounds in such therapeutically useful compositions is such that a suitable dosage will be obtained.

[0162] The tablets, troches, pills, capsules and the like may also contain the following: a binder, as gum tragacanth, acacia, cornstarch, or gelatin; excipients, such as dicalcium phosphate; a disintegrating agent, such as corn starch, potato starch, alginic acid and the like; a lubricant, such as magnesium stearate; and a sweetening agent, such as sucrose, lactose or saccharin may be added or a flavoring agent, such as peppermint, oil of wintergreen, or cherry flavoring. When the dosage unit form is a capsule, it may contain, in addition to materials of the above type, a liquid carrier. Various other materials may be present as coatings or to otherwise modify the physical form of the dosage unit. For instance, tablets, pills, or capsules may be coated with shellac, sugar or both. A syrup of elixir may contain the active compounds sucrose as a sweetening agent methyl and propylparabens as preservatives, a dye and flavoring, such as cherry or orange flavor. Of course, any material used in preparing any dosage unit form should be pharmaceutically pure and substantially non-toxic in the amounts employed. In addition, the active compounds may be incorporated into sustained-release preparation and formulations.

[0163] Upon formulation, the compounds will be administered in a manner compatible with the dosage formulation and in such amount as is therapeutically effective. The formulations are easily administered in a variety of dosage forms, such as those described below in specific examples.

[0164] C. Liposomes

[0165] In a particular embodiment, liposomal formulations are contemplated. Liposomal encapsulation of pharmaceutical agents prolongs their half-lives when compared to conventional drug delivery systems. Because larger quantities can be protectively packaged, this allow the opportunity for dose-intensity of agents so delivered to cells. This would be particularly attractive in the chemotherapy of cervical cancer if there were mechanisms to specifically enhance the cellular targeting of such liposomes to these cells.

[0166] “Liposome” is a generic term encompassing a variety of single and multilamellar lipid vehicles formed by the generation of enclosed lipid bilayers. Phospholipids are used for preparing the liposomes according to the present invention and can carry a net positive charge, a net negative charge or are neutral. Dicetyl phosphate can be employed to confer a negative charge on the liposomes, and stearylamine can be used to confer a positive charge on the liposomes. Liposomes are characterized by a phospholipid bilayer membrane and an inner aqueous medium. Multilamellar liposomes have multiple lipid layers separated by aqueous medium. They form spontaneously when phospholipids are suspended in an excess of aqueous solution. The lipid components undergo self-rearrangement before the formation of closed structures and entrap water and dissolved solutes between the lipid bilayers (Ghosh and Bachhawat, 1991). Also contemplated are cationic lipid-nucleic acid complexes, such as lipofectamine-nucleic acid complexes

[0167] V. Kits

[0168] Any of the compositions described herein may be comprised in a kit. In a non-limiting example, reagents for determining the genotype of one or both UGT1A1 genes are included in a kit. The kit may further include individual nucleic acids that can be amplify and/or detect particular nucleic acid sequences the UGT1A1 gene. It may also include one or more buffers, such as a DNA isolation bufffers, an amplification buffer or a hybridization buffer. The kit may also contain compounds and reagents to prepare DNA templates and isolate DNA from a sample. The kit may also include various labeling reagents and compounds.

[0169] The components of the kits may be packaged either in aqueous media or in lyophilized form. The container means of the kits will generally include at least one vial, test tube, flask, bottle, syringe or other container means, into which a component may be placed, and preferably, suitably aliquoted. Where there are more than one component in the kit (labeling reagent and label may be packaged together), the kit also will generally contain a second, third or other additional container into which the additional components may be separately placed. However, various combinations of components may be comprised in a vial. The kits of the present invention also will typically include a means for containing the nucleic acids, and any other reagent containers in close confinement for commercial sale. Such containers may include injection or blow-molded plastic containers into which the desired vials are retained.

[0170] When the components of the kit are provided in one and/or more liquid solutions, the liquid solution is an aqueous solution, with a sterile aqueous solution being particularly preferred. However, the components of the kit may be provided as dried powder(s). When reagents and/or components are provided as a dry powder, the powder can be reconstituted by the addition of a suitable solvent. It is envisioned that the solvent may also be provided in another container means.

[0171] A kit will also include instructions for employing the kit components as well the use of any other reagent not included in the kit. Instructions may include variations that can be implemented.

[0172] It is contemplated that such reagents are embodiments of kits of the invention. Such kits, however, are not limited to the particular items identified above and may include any reagent used directly or indirectly in the detection of polymorphisms in the UGT1A1 gene or the activity level of the UGT1A1 polypeptide.

EXAMPLES

[0173] The following examples are included to demonstrate preferred embodiments of the invention. It should be appreciated by those of skill in the art that the techniques disclosed in the examples which follow represent techniques discovered by the inventor to function well in the practice of the invention, and thus can be considered to constitute preferred modes for its practice. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the spirit and scope of the invention.

Example 1 Materials and Methods for Examples 2-5

[0174] Chemicals and Reagents

[0175] Exonuclease I and shrimp alkaline phosphatase (exo/SAP) was purchased from USB (Cleveland, Ohio, USA). ABI Big Dye terminator cycle-sequencing kit was purchased from Applied Biosystems (Foster City, Calif., USA). Primers for amplification, sequencing of the PBREM, and amplification of the (TA), polymorphism were obtained from GibcoBRL (Invitrogen Co., Carlsbad, Calif., USA). SN-38 was kindly provided by Dr Kiyoshi Terada (Yakult Honsha Co., Ltd, Japan). Camptothecin, UDPGA, magnesium chloride, trizma base, potassium monohydrogen phosphate and 1-heptanesulfonic acid were purchased from Sigma-Aldrich (St. Louis, Mo., USA). Acetonitrile, tetrahydrofuran and hydrochloric acid were obtained from Fisher Scientific (Hanover, Ill., USA).

[0176] Human Livers

[0177] Normal human livers (n=83) were mainly obtained from Liver Tissue Procurement and Distribution System (National Institutes of Diabetes and Digestive and Kidney Diseases, Minneapolis, Minn.). DNA was isolated by using Qiagen RNA/DNA Maxi Kit (Qiagen Inc., Valencia, Calif., USA), and microsomes were isolated following differential centrifugation methods (Purba et al., 1987). DNA and microsomes were provided by the Liver Core Bank Facility (St. Jude Children's Research Hospital) of the Pharmacogenetics of Anticancer Agents Research (PAAR) Group. In order to identify livers in which enzyme degradation occurred, liver samples consistently comprised in the 10th percentile of UGT1A1, UGT1A9 and UGT2B7 activities were sought. UGT1A9 and UGT2B7 activities were measured using specific probes (data not shown) (Ramirez et al., 2002 and Innocenti, et al., 2001). Out of eight samples within the 10th percentile of UGT1A1, only one sample was comprised within the 10th percentile of activities of the other two enzyme activities. If different handling/storage of the liver or microsomal protein degradation occurred in that sample, this should not have affected the degree of phenotype/genotype correlation because the individual had a 7/7 genotype, and among the 7/7 genotype samples (n=11), it had the 4th lower value. Moreover, lack of correlation between UGT1A1 and UGT2B7 activities (n=83, r=0.07, P=0.5) shows that differences in tissue integrity and microsome stability have probably a mild influence (if any) on the UGT phenotype.

[0178] The ethnic composition of the 83 liver donors comprised: Caucasians 68%, African-Americans 18%, Asians 1%, others 2%. The percentage of samples of unknown ethnic origin was 12%.

[0179] Genotyping of (TA)_(n) Polymorphism

[0180] In order to genotype the (TA)_(n) polymorphism, approximately 40 ng of DNA was subjected to amplification by polymerase chain reaction (PCR). The amplification primers used have been previously described (Monaghan et al., 1996), where the sequence of the forward primer is 5′-GTCACGTGACACAGTCAAAC-3′ (SEQ ID NO:2) and that of the reverse primer is 5′-TTTGCTCCTGCCAGAGGTT-3′ (SEQ ID NO:3). These primers flank the polymorphic TA locus in the promoter region of the UGT1A1 gene and amplify a 98 bp fragment when a (TA)₆ allele is present and a 100 bp fragment when a (TA)₇ allele is present. In the presence of (TA)₅ and (TA)₈ alleles, 96 bp and 102 bp alleles are amplified. The reverse primer is labeled with a fluorescent dye at its 5′-end to permit visualization of the amplification product. The amplification reactions were performed in a 10 μl volume consisting of 1.5 mmol MgCl₂, 250 mmol dNTPs, 0.8 mmol of each primer and 0.5 U of Taq polymerase (Amplitaq Gold from Applied Biosystems). The polymerase was activated at 95° C. for 10 min and DNA amplified for 35 cycles at 95° C. for 30 sec, 55° C. for 30 sec and 72° C. for 30 sec, followed by a final extension at 72° C. for 10 min. Control DNAs from individuals known to have a 6/6, 6/7 and 7/7 genotype were included in the PCR analysis. PCR fragments were subjected to gel electrophoresis on an ABI 377 DNA analyzer (Applied Biosystems). Amplified products were diluted in a formamide and dextran blue loading buffer and 1 μl combined with 1 μl of size standard (GS-350 from Applied Biosystems), denatured at 95° C., and loaded onto a 6% denaturing polyacrylamide gel. Electrophoresis was performed for 3.5 hours following manufacturers recommendations. The Genescan and Genotyper software (version 3.7, Applied Biosystems) was used to analyze fragments for size determination.

[0181] Sequencing of PBREM

[0182] A 606 bp region (−3641 to −3036) including PBREM was successfully PCR-amplified and sequenced in 81 of the 83 human liver DNAs and 22 of the 24 DNA samples from African-American individuals (Americans of African descent, born in the USA) included in the NIGMS HGCR Human VariationPanel (Coriell Institute for Medical Research, Camden, N.J., USA). The reference sequence shown in FIG. 1 is that deposited in the GenBank database (accession number AF313454). Amplification of the PCR product was performed in a 10 or 25-pl reaction volume using the following primers: 5′-CTGGGGATAAACATGGGATG-3′ SEQ ID NO:4 (forward) and 5′-CACCACCACTTCTGGAACCT-3′ SEQ ID NO:5 (reverse). These primers were designed using Primer3 software (Rozen et al., 1998). PCR conditions were 2 min at 94° C., 32 or 33 cycles of a three-step cycling program (30 sec at 94° C., 30 sec at 66.8° C. and 1 min at 72° C.) and 72° C. for 3 min. Following exo/SAP cleanup of the PCR product, this amplicon was then sequenced in the forward and reverse directions using the amplification primers, Big Dye terminator chemistry, and run on an ABI 3700 (Applied Biosystems) following the manufacturer's protocol. Sequences were analyzed and individuals genotyped using the Poly-Phred software (Nickerson et al., 1997). To determine the ancestral state of the polymorphisms found in humans, the sequence was compared to that of baboon (accession number AC091778).

[0183] SN-38 Glucuronidation Assay in Human Liver Microsomes

[0184] Samples were phenotyped by using SN-38 as a substrate for UGT1A1. The incubation mixture consisted of 5 μmol SN-38, 10 mmol MgCl₂, 1 mg/ml microsomes, 0.025 mol Tris-HCl (pH 7.4) and 5 mmol UDP-GA. Samples were incubated for 30 min at 37° C. The reaction was stopped by the addition of methanol. These conditions were selected after previous optimization of the enzyme reaction (Iyer et al., 1998). Camptothecin (75 ng) was used as an internal standard. SN-38 glucuronidation was measured by HPLC (Hitachi Instruments Inc., San Jose, Calif., USA) with fluorescence detection (λ excitation=355 nm, λ emission=515 nm). A μBondapak™ C₁₈ column (3.9×300 mm, 10 μm; Waters Corp., Milford, Mass., USA) and μBondapak™ C₁₈ guardpak (Waters Corp.) were used. A mobile phase of Aug. 4, 1988 acetonitrile/tetra-hydrofuran/0.9 mmol sodium heptanedfonic acid in 50 mmol potassium dihydrogen phosphate (pH 4) was used during the first 7 min of the run. From 7.1-25 min, the eluent consisted of 30/70 acetonitrile/5 mmol sodium heptanesulfonic acid in 50 mmol potassium dihydrogen phosphate (pH 4). The flow rate was 0.9 ml/min. Retention times for SN-38G, SN-38 and camptothecin were 13.3, 18.4 and 19.3 min, respectively. SN-38 glucuronidation rates were reported as the ratios between SN-38 glucuronide (SN-38G) and internal standard (IS) peak heights. The intra-assay variability was determined by performing 10 incubations on the same day using a pool of human liver microsomes. The inter-assay variability was evaluated by incubating a pool of human liver microsomes in triplicate on three different days. The inter- and intra-assay variabilities were within 7%.

[0185] Statistical Analysis

[0186] The significance of linkage disequilibrium between pairs of polymorphic sites was assessed using genotypic data and a likelihood ratio test provided in ARLEQUIN, version 2 (Schneider et al., 2000). ARLEQUIN was also used to run a modified Markov-chain random walk algorithm to test for Hardy-Weinberg equilibrium. Next, multisite haplotypes were estimated using the program PHASE (Stephens et al., 2001). Because this program does not accept both bi-allelic and multi-allelic polymorphic sites, haplotypes were estimated only for individuals with either the (TA)₆ or (TA)₇ alleles.

[0187] Thirteen individuals were heterozygous for the (TA)₅ or (TA)₈ repeat, three of which were heterozygous only at the TA repeat and therefore unambiguous at the other sites. For the remaining 10 individuals, haplotypes were determined manually by assuming that the chromosome with the (TA)₆ or (TA)₇ allele contained a haplotype previously identified by the PHASE analysis. In one case, this method would have resulted in a new (TA)₈ haplotype. However, it is more likely that this individual would instead have a novel (TA)₆ haplotype (V), which is consistent with the observation that the (TA)₆ allele is found on multiple haplotypes, including other rare ones. An incorrect assignment would have little or no affect on the subsequent analyses because the novel haplotype only occurs once out of 103 individuals and not in a sample used in studies of correlation with phenotype.

[0188] The effective number of haplotypes was calculated as the reciprocal of the sum of the frequency squared. Diversity in (TA)₆ haplotypes in Caucasians and African-Americans, based on the numbers and frequencies of haplotypes and adjusted by sample size, was estimated by DnaSP version 3.53 (Rozas et al.) as well as their SDs. Statistical significance was assessed using a t-test as previously described (Nei, 1987). The chi-square test was used to analyze the differences in genotype/haplotype frequencies between Caucasians and African-Americans.

[0189] UGT1A1 activity was phenotyped by measuring SN-38 glucuronidation rates of each liver as the mean±SD of a single experiment performed in triplicate. The statistical analysis of the relationship between the (TA)_(n) polymorphism and phenotype was planned to assess first the genotype effect on phenotype in the population sample (n=83) using the analysis of variance (ANOVA). If the genotype effect was statistically significant then, within each ethnic group, a test of trend across the genotypes was performed using the exact Jonkheerer-Terpstra (JT) test (Gibbons et al., 1992). Pairwise comparisons between two genotypes were performed using an exact one-sided Wilcoxon test. Moreover, trend analysis and pairwise comparisons were performed in genotypes expressed as the sum of TA repeats in both chromosomes (i.e. in samples with <12 (5/6, 6/6, 5/7), 13 (6/7) and ≦14 (7/7, 6/8, 7/8) TA repeat genotypes). Concerning the haplotype-phenotype relationship, two-sided exact Wilcoxon tests were used to compare the SN-38 glucuronidation rates between two haplotypes. SAS system (SAS Institute, Inc., Cary, N.C.) and StatXact-5 (CYTEL Software Corporation, Cambridge, Mass., USA) were used for statistical analysis. GraphPad software version 3.02 (GraphPad Software Inc., San Diego, Calif., USA) was used for graphical analysis.

Example 2 Genotyping of (TA)_(n) Polymorphism

[0190] The (TA)₆ allele was the most common allele with a frequency of 0.58 while the (TA)₇ allele had a frequency of 0.36 (Table 1). (TA)₅ and (TA)₈ alleles were also found, although at lower frequencies (0.02 and 0.05, respectively). In the population sample (n=107), the most common genotype was 6/7 (0.41), followed by the 6/6 genotype (0.34). Rare genotypes (<0.02) included 5/6, 5/7 and 5/8 genotypes. The (TA)₆ and (TA)₇ allele frequencies were not significantly different between Caucasians and African-Americans (chi-square test, P=0.7). Similarly, 6/6, 6/7, and 7/7 genotype frequencies were not different between the two ethnic groups (chi-square test, P=0.8). One Asian individual had a 6/6 genotype, while two individuals with other ethnicities had 6/7 and 7/7 genotypes. TABLE 1 (TA)_(n) polymorphism: genotype frequencies (TA)_(n) 5/6 5/7 5/8 6/6 6/7 6/8 7/7 7/8 Population 0.01 0.02 0.01 0.34 0.41 0.06 0.13 .03 Sample (n = 107) Caucasians 0.02 0 0 0.38 0.46 0 0.13 0.02 (n = 56) African- 0 0.05 0.03 0.26 0.33 0.15 0.13 0.05 Americans (n = 39

Example 3 Sequencing of PBREM

[0191] In 103 samples, six polymorphisms were found, and two of them (−3279G>T and −3156G>A) are common, with frequencies of 0.39 and 0.30, respectively (FIG. 1, Table 2). All six polymorphisms are in Hardy-Weinberg equilibrium (P>0.5). Based upon comparisons to the baboon sequence (accession number AC091778, which is incorporated herein by reference), it is likely that −3279G and −31566 are the ancestral states. The most common −3279G>T polymorphism is located in the spacer sequence of the NR3 domain of PBREM (FIG. 1). No variants were found in the gtNR1 domain, the binding site for constitutive active receptor (CAR). −3279G was significantly more common among African-Americans compared to Caucasians (chi squared=13.82, P=0.001) while the frequency of −3156A did not significantly differ between the two ethnic groups (chi-square test, P=0.9). TABLE 2 Sequencing of PBREM: genotype frequencies Position −3440 −3401 −3279 −3177 −3175 −3156 Genotype CC CA TT TC GG GT TT CC CG AA AG GG GA AA Population 0.96 0.04 0.99 0.01 0.38 0.46 0.16 0.99 0.01 0.99 0.01 0.49 0.43 0.08 Sample (n = 103) Caucasian 0.96 0.04 1 0 0.18 0.58 0.24 1 0 1 0 0.47 0.44 0.09 (n = 55) African- 0.95 0.05 0.97 0.03 0.73 0.24 0.03 0.97 0.03 0.97 0.03 0.51 0.41 0.06 Americans (n = 37)

Example 4 Linkage Disequilibrium and Haplotype Structure of the UGT1A1 Promoter

[0192] A likelihood ratio test detected significant pairwise linkage disequilibrium between sites −3279, −3156 and the (TA)_(n) polymorphism in our population sample (n=103, P<0.0001). When only the common (TA)₆ and (TA)₇ alleles were used for the linkage disequilibrium analysis, the same results were obtained (P<0.0001). When pairwise linkage disequilibrium was separately assessed in Caucasians and African-Americans, highly significant linkage disequilibrium was similarly detected in Caucasians (P<0.0001). In African-Americans, pairwise linkage disequilibrium was also detected between all sites, however, the level of significance varied greatly between the pairwise comparisons. Only linkage disequilibrium between (TA)_(n) and −3156 had significance levels similar to those seen for Caucasians (P<0.0005) while linkage disequilibrium had only low levels of significance between (TA)_(n) and −3279 (P=0.02) and between −3279 and −3156 (P=0.04).

[0193] Multisite haplotype inference resulted in 10 haplotypes spanning the PBREM variants and the (TA)_(n), polymorphism (Table 3). Haplotypes I-V include the (TA)₆ allele, and haplotype I differs from haplotype II at position −3279 in the NR3 domain of PBREM. Haplotypes VI, VII and VIII include the (TA)₇ repeat and haplotypes VI and VII differ from each other at position −3156. There is a suggestion that the haplotype structure of the (TA)₆ allele is different in the African-American subsample. Compared to Caucasians, haplotype I is less common in African-Americans (chi squared=27.06, P<0.0001), while haplotype II is more common (chi squared=14.84, P=0.0001). Differences in haplotype VI and VII frequencies were not statistically significant between the two groups (chi-square test, P=0.44 and 0.48, respectively).

[0194] Among the samples examined, 21 different combinations of these haplotypes were found. In Caucasians, the most frequent haplotype pairs are I/VI (0.35), I/I (0.24) and I/II (0.11), while in African-Americans, they are I/II (0.11), II/VI (0.11), II/VIII (0.08), I/VI (0.08), II/VII (0.08) and VI/VI (0.08). The effective numbers of haplotypes, which reflect how many relatively high frequency haplotypes are observed, were 5.2 and 2.6 in African-Americans and Caucasians, respectively (Table 3). Finally, diversity (+SD) of (TA)₆ haplotypes was 0.555±0.070 and 0.262±0.065 in African-Americans and Caucasians, respectively (P<0.05). TABLE 3 Haplotype structure of promoter mutations and haplotype frequencies. The effective number of haplotypes is also reported. Population African- Sample Caucasians Americans Haplotype −3440 −3401 −3279 −3177 −3175 −3156 (TA)_(n) (n = 103) (n = 55) (n = 37) I C T T C A G (TA)₆ 0.39 0.53 0.15 II C T G C A G (TA)₆ 0.15 0.07 0.28 III A T G C A G (TA)₆ 0.02 0.02 0.03 IV C C G C A G (TA)₆ 0.01 0 0.01 V C T G C A A (TA)₆ 0.01 0 0.01 VI C T G C A A (TA)₇ 0.29 0.31 0.28 VII C T G C A G (TA)₇ 0.07 0.05 0.08 VIII C T G G G A (TA)₇ 0.01 0 0.01 IX C T G C A G (TA)₈ 0.05 0.01 0.12 X C T G C A G (TA)₅ 0.02 0.01 0.04 3.7^(a) 2.6^(a) 5.2^(a)

Example 5 UGT1A1 Phenotyping and Association with (TA)_(n) Polymorphism and Haplotypes

[0195] UGT1A1 activity was measured as SN-38 glucuronidation rates in 83 human liver microsomes. A 46% coefficient of variation (1.90 f 0.87 SN-38G/IS, mean f SD) and a 10-fold range in SN-38 glucuronidation were observed.

[0196] Because of the small number of subjects in the 5/7, 5/6, 6/8 and 7/8 genotypes, only 6/6, 6/7 and 7/7 were used in the ANOVA analysis. The phenotype was significantly different across these three genotypes (P=0.008) (FIG. 2a). The degree of variation of the SN-38 glucuronidation rate across the genotypes was similar in different ethnic groups (P>0.1). A significantly decreasing trend was shown across the 6/6, 6/7 and 7/7 genotypes in Caucasians (P<0.001, JT test, FIG. 2b) and across the 6/6, 6/7, 6/8 and 7/7 genotypes in African-Americans (P=0.033, JT test) (FIG. 2c). When samples with Asian (n=1), other (n=2) and unknown (n=10) ethnic background were pooled together, no significant trend could be found across (TA)_(b) genotypes (P>0.1, JT test) (FIG. 2d). In the Caucasian sample, pairwise comparisons of the phenotype between two genotype groups showed significant differences between the 6/7 and 7/7 (P=0.007, one-sided exact Wilcoxon test) and 6/6 and 7/7 groups (P=0.0002). No pair-wise comparison was significant within African-Americans, probably due to small number of samples of each genotype.

[0197] When (TA)_(n) genotypes were regarded as the sum of TA repeat number in both chromosomes (i.e. ≦12 (5/6, 6/6, 5/7), 13 (6/7) and ≧14 (7/7, 6/8, 7/8) genotypes), a significant trend of reduced UGT1A1 activity (P<0.01) was measured across the three groups (the lowest being the >14 genotype group) in the whole sample population, in Caucasians, in African-Americans but not in samples with Asian/other/unknown ethnicity (P=0.66). Pairwise comparisons (one-sided exact Wilcoxon test) showed significantly reduced UGT1A1 activity (P<0.01) in ≧14 compared to 13 and ≦12 genotypes, and in 13 compared to ≦12 genotypes in the whole sample population and in Caucasians. In African-Americans≦12 genotypes had significantly higher UGT1A1 activity compared to either 13 or ≧14 genotypes (P=0.028 and 0.016, respectively), but UGT1A1 activity was not significantly different between 13 and ≧14 genotypes (P=0.11).

[0198] In samples of Caucasian and African origin, SN-38 glucuronidation rate varies significantly across the haplotypes with a decreasing trend (P<0.0001, JT test) (FIG. 3). However, this apparent haplotype-phenotype correlation is likely to be due to the effect of the (TA)_(n) polymorphism that is in linkage disequilibrium with the PBREM variants. Hence, the possible functional effects of the common −3279G>T and −3156G>A variants were investigated by comparing the SN-38 glucuronidation rates across genotypes that differed only by the variant being examined. Concerning the −3279G>T variant, SN-38 glucuronidation was reduced in I/II pairs compared to I/I pairs among Caucasians, although without reaching statistical significance (2.06±0.74 versus 2.53±0.82 SN-38G/IS, respectively) (Wilcoxon rank sum test, P=0.18). Concerning the −3156G>A variant, although SN-38 glucuronidation is slightly reduced in I/VII compared to I/VI pairs, the difference is not statistically significant (Wilcoxon rank sum test, P=0.64).

Example 6 Materials and Methods for Example 7

[0199] Patient Selection

[0200] Patients with histologically confirmed solid tumors or lymphoma known to respond to irinotecan or for which no therapy of proven benefit exists were eligible to participate in this study. Other eligibility criteria included measurable disease by radiologic imaging or physical examination; age of at least 18 years; Karnofsky performance status of at least 70% (ambulatory and capable of self-care); and adequate organ function defined as absolute neutrophil count (ANC)≧1500 μl⁻¹, platelet count≧100,000 μl⁻¹, serum creatinine level≦1.5 mg/dl or creatinine clearance≧60 ml/min, AST and ALT levels<5 times the upper limit of normal, and conjugated bilirubin within normal limits. Patients must have been off previous anticancer therapy, including radiation therapy, for at least 4 weeks (6 weeks if the previous treatment included a nitrosourea or mitomycin C) and off colony stimulating factor for at least 2 weeks. Patients with a history of inflammatory bowel disease requiring therapy, chronic diarrheal syndrome, paralytic ileus, or organ or stem cell transplant were excluded from the study. Concurrent use of medications that may be substrates of the UGT1A1 enzyme or that may be inducers or inhibitors of UGT1A1 activity was not permitted. Pregnant and lactating women were also excluded from participation, and those with reproductive potential were required to use an effective contraceptive method if sexually active.

[0201] Treatment Protocol

[0202] Irinotecan was supplied by the National Cancer Institute (NCI) as an intravenous solution with concentration 20 mg/ml in either 2 ml or 5 ml vials. The amount of irinotecan to be administered was removed aseptically from the vial and added to 500 ml of 0.9% saline or 5% dextrose injection, USP. Thirty minutes after pretreatment with 20 mg intravenous ondansetron, irinotecan 350 mg/m² was administered as a 90 minute intravenous infusion once every 3 weeks—a standard dose and schedule. History, physical examination, complete blood count (CBC) with differential, serum chemistry profile (electrolytes, blood urea nitrogen, creatinine, glucose, albumin, alkaline phosphatase, GGTP, AST, ALT, total and conjugated bilirubin, uric acid, and lactate dehydrogenase), and coagulation profile (prothrombin time and partial thromboplastin time) were conducted prior to first treatment. Thereafter, history, physical examination, and toxicity assessment were conducted on day 1 of each cycle unless treatment-related toxicity required more frequent follow up. CBC and serum chemistry profile were obtained weekly throughout treatment, though CBCs were obtained 3 times per week with the appearance of grade 3 or 4 neutropenia or thrombocytopenia. Toxicity assessment was done according to the NCI common toxicity criteria, version 2.0 (website: ctep.cancer.gov). Objective tumor assessment by appropriate radiographic imaging was performed prior to starting therapy and after every 2 cycles.

[0203] Toxicity Management and Dose Modification

[0204] For patients who experienced diarrhea, abdominal pain, or diaphoresis within 24 hours of irinotecan administration, 0.25 mg to 1 mg of intravenous atropine was considered. Delayed diarrhea, defined as diarrhea occurring more than 24 hours after irinotecan administration, was treated promptly with loperamide 4 mg at the onset and then with 2 mg every 2 hours until the patient was diarrhea-free for at least 12 hours. For patients who failed loperamide therapy, diphenoxylate, octreotide, and tincture of opium were sequentially added as needed. Patients were instructed to aggressively hydrate orally and were admitted to the hospital for intravenous electrolyte and fluid replacement when necessary. A new course of therapy was not started until the ANC recovered to at least 1500 μl⁻¹, the platelet count recovered to at least 100,000 μl⁻¹, and treatment-related diarrhea fully resolved. Patients with grade 3 or 4 toxicities of any kind were dose-reduced by 50 mg/m² for subsequent cycles.

[0205] Sample Collection

[0206] Prior to the first irinotecan infusion, venous blood (4.5 ml) for genotyping was collected in purple top Vacutainer® tubes containing EDTA (Becton, Dickinson, and Company, Franklin Lakes, N.J.) and stored at −80° C. for no more than 5 days prior to analysis. Venous blood for pharmacokinetic analysis was collected on day 1 of cycle 1 for pharmacokinetic analysis. Samples of 7 ml were collected into green top sodium heparinized Vacutainer® tubes prior to the infusion; 30, 60, and 90 minutes during the infusion; and 10, 20, 30, 45, and 60 minutes and 1.5, 2, 4, 6, 12, and 24 hours after the infusion. Samples were centrifuged (2500 rpm, 20 min, 4° C.) and the plasma was immediately separated, transferred as two aliquots into storage tubes, frozen at −80° C. until analysis.

[0207] UGT1A1 Genotyping Assays

[0208] The variants typed in this study are listed in Table 4. The UGT1A1 (TA)nTAA polymorphism was genotyped by PCR and product sizing as previously described (Te et al., 2000). Alleles with 6 TA repeats resulted in a 98 bp fragment while alleles with 7 TA repeats resulted in a 100 bp fragment. Alleles with 5 TA and 8 TA repeats resulted in 96 bp and 102 bp fragments respectively. Alleles with 5, 6, 7, and 8 TA repeats are reported as (TA)_(n) and genotypes are assigned based upon the number of TA repeats in each allele, i.e., 6/6, 6/7, 7/7, 6/8, et cetera.

[0209] The variants in the 5′upstream region (−3279G>T and −3156G>A) and in exon 1 [211G>A (G71R) and 686C>A (P229Q)] were genotyped by single base extension (SBE) and separated on a denaturing high performance liquid chromatography (DHPLC) system (Devaney et al., 2001). Genotyping of the −3279G>T and −3156G>A variants was performed by PCR amplification of a 333 bp fragment in the UGT1A1 5′ upstream region that contains both variants. The PCR primers used were: 5′-ACC TCT AGT TAC ATA ACC TGA A-3′ (forward primer; SEQ ID NO:6) and 5′-AAT AAA CCC GAC CTC ACC AC-3′ (reverse primer; SEQ ID NO:7). PCRs were performed in a 15 μl volume containing 125 nM each primer, 2.5 mM MgCl₂, 50 μM each dNTP and 0.375 U of AmpliTaq Gold polymerase (Applied Biosystems) in the buffer provided by the manufacturer. PCR cycling conditions were for 40 cycles at 95° C. for 15 s, 58° C. for 15 s and 72° C. for 30 s in a 9600 thermal cycler (Applied Biosystems). PCR amplified products were purified using shrimp alkaline phosphatase and exonuclease I by incubating at 37° C. for 45 min prior to the SBE reaction. SBE reactions were performed in duplex for genotyping of both variants in 10 μl volumes containing 1 μM of extension primer (5′-GCC AAG GGT AGA GTT CAG T-3′ (SEQ ID NO:8) for −3279G>T and 5′-GAC CCC AGC CCA CCT GTC-3′ (SEQ ID NO:9) for −3156G>A), 250 μM each ddNTP and 1.25 U thermosequenase (Amersham Pharmacia Biotech). Reactions were cycled at 96° C. for 30 s, 55° C. for 30 s and 60° C. for 30 s for 60 cycles. Separation of the SBE products was performed on a WAVE 3500HT DHPLC system (Transgenomic Inc) at 70° C. after denaturation of the samples. The flow rate used was 1.5 ml/min and the run time for each sample was 2.5 min. The gradient used for elution of the SBE products was created by the software based on the length of the extended product and was adjusted from 24% to 34% buffer B over 2 min (buffer B contains 25% acetonitrile). Extended products were eluted in the order of C<G<T<A which is dependent on the hydrophobicity differences of the four bases.

[0210] Genotyping of the 211 G>A and 686C>A exon 1 variants was performed by PCR amplification of a 774 bp fragment that encompasses both variants. The PCR primers used were: 5′-ATG CTG GGA AGA TAC TGT TG-3′ (forward primer; SEQ ID NO:10) and 5′-TTT GGT GAA GGC AGT TGA TT-3′ (reverse primer; SEQ ID NO:11). PCRs were performed in a 15 μl volume containing 125 nM each primer, 2.5 mM MgCl₂, 100 μM each dNTP and 0.375 U of AmpliTaq Gold polymerase (Applied Biosystems) in the buffer provided by the manufacturer. PCR cycling conditions were for 40 cycles at 95° C. for 15 s, 55° C. for 15 s and 72° C. for 45 s in a 9600 thermal cycler (Applied Biosystems). PCR purification was performed as described above and the SBE reactions were performed in 10 ul volumes containing 1 μM of each extension primer (5′-GTC TTC AAG GTG TAA AAT GCT C-3′ (SEQ ID NO:12) for 211G>A or the 5′-GTG CGA CGT GGT TTA TTC CC-3′ (SEQ ID NO:13) for 686C>A) using the conditions described above. For separation on the DHPLC system, a flow rate of 1.5 ml/min and a run time of 3 min was used for each sample. The gradient used for elution of the SBE products was created by the WAVE software based on the length of the extended product and was adjusted from 25.6% to 38.1% buffer B over 2.5 min.

[0211] Pharmacokinetic Analysis

[0212] Plasma concentrations of irinotecan and its metabolites were determined as previously published (Iyer et al., 2001). Pharmacokinetic parameters for irinotecan, SN-38, and SN-38G were calculated using standard non-compartmental methods with WinNonlin 2.0 (Pharsight Corporation, Mountain View, Calif.). The area under the plasma concentration-time curve (AUC) from time zero to the last measured concentration of irinotecan and metabolites was determined by the linear trapezoidal method. The glucuronidation ratio was expressed as the ratio of the SN-38G AUC over SN-38 AUC.

[0213] Statistical Analysis

[0214] The study was originally designed to prospectively investigate the relationship between genetic variation in the UGT1A1 promoter and grade 3-4 diarrhea. Results from clinical trials using the 350 mg/m2 every 3 weeks schedule suggested a 20 to 35% frequency of diarrhea (ref). Based on previously published data, a single-gene Mendelian model implied that 16% of patients would have the 7/7 genotype, 48% would have the 6/7 genotype, and 36% would have the 6/6 genotype. A sample size of 60 would have had power of 0.8 at α=0.05 to detect a linear trend in the proportion of patients within each genotype experiencing grade 3-4 diarrhea defined by 60% of 7/7 patients, 30% of 6/7 patients, and 10% of 6/6 patients.

[0215] However, due to lower than expected frequency of grade 3-4 diarrhea (see below), the analyses were instead focused on the frequency of grade 4 neutropenia (ANC<500 μl⁻¹). Nonparametric trend tests were used to investigate how the genotype is related to pharmacokinetic parameters, pretreatment bilirubin levels and ANC nadir. The relationship between genotype and grade 4 neutropenia was assessed by the use of Fisher's exact test and calculation of the relative risks. Univariate regression analyses were performed to identify the potential predictors of ANC nadir. They were performed on the log scale for ANC to reduce skewness in the residuals. The pretreatment variables were also considered jointly via analysis of covariance (ANCOVA) models in order to identify the pretreatment measurements that can predict ln(ANC nadir). A different ANCOVA model simultaneously considering the pre- and post-treatment variables was used to explore the mechanism through which variability in UGT1A1 status might affect the ANC nadir.

Example 7 Role of −3516G>A of UGT1A1 in Irinotecan Toxicity

[0216] Patient Characteristics

[0217] Sixty-six patients were enrolled in the study (Table 5). Blood was mistakenly not drawn for DNA extraction in one patient and genotype information is available in 65 patients. Sixty-three patients were assessable for toxicity as 3 patients (one 6/6, one 6/7, one 7/8) missed scheduled blood tests and/or physician appointments. Sixty patients are assessable for tumor response, as 6 of them were removed from the study before radiological assessment of tumor response. All the patients received prior chemotherapy regimens. Thirty-five of them received additional prior radiotherapy.

[0218] Allele and Genotype Frequencies

[0219] The TA indel allele frequencies were: TA₆=0.68, TA₇=0.29, TA₈=0.02, TA₅=0.01. The TA₅ and TA₈ alleles occurred exclusively in Black patients (one with 5/6, two patients with 6/8, and one patient with 7/8 genotype). −3279T and −3156A alleles had a frequency of 0.55 and 0.26, respectively.

[0220] Table 6 shows the frequencies of promoter haplotypes comprising −3279, −3156, and the TA indel, based upon our previous publication on their linkage disequilibrium (Innocenti et al., 2002). The frequency of the haplotype pairs is shown in Table 7. No exon 1 variants (211G>A and 686C>A) were detected in this patient population.

[0221] Toxicity Prevalence, Relative Risk, Genetic Test.

[0222] Toxicity of diarrhea and neutropenia refer to events observed during cycle 1 of treatment. The frequency of grade 4 neutropenia was 9.5%. Grade 4 neutropenia was much more common in patients with genotype 7/7 (3/6, 50%) compared to patients with 6/7 genotype (3/24, 12.5%) and 6/6 genotype (0/30, 0%) (p=0.001, Fisher's exact test). Nonparametric trend analysis revealed that the TA indel polymorphism is significantly correlated to ln(ANC nadir) (7/7<6/7<6/6, z=−2.35, p=0.02) (FIG. 4).

[0223] Because the −3156G>A variant distinguishes between two different haplotypes in the TA₇ individuals, the relative risk of grade 4 neutropenia was analyzed for the −3156 AA genotype (versus AG and GG combined) and 7/7 genotype (versus 5/6, 6/6, 6/7 and 6/8 combined). A higher relative risk was found in patients with −3156 AA genotype (14.0, 95% CI 2.1-36.7) compared to patients with 7/7 genotype (9.3, 95% CI 1.7-40.7, n=63). Moreover, the predictive power of a genetic test in patients receiving irinotecan was evaluated for both the TA indel and the −3156 variant (Table 8). The predictive power of either 7/7 or −3156 AA genotypes for grade 4 neutropenia was evaluated. In addition, the predictive power of either 6/6 or −3156 GG genotypes was evaluated in relation to the absence of grade 4 neutropenia (i.e., grade 0-3). In this comparison, the two 6/8 patients were regarded as either 6/6 or 6/7 genotypes in order to assess whether patients with the TA₈ allele might be a confounding factor for the results of the genetic test.

[0224] While this study was originally conceived to examine the relationship between UGT1A1 genotype and severity of diarrhea, the frequency of grade 3 diarrhea in our patients was only 5% (n=3), with no instances of grade 4 diarrhea. None of the three patients with grade 3 diarrhea were 6/6 (2 6/7 and one 7/7 genotypes). Concerning the diarrhea events in patients with the TA₈ allele (two 6/8 and one 7/8), only a grade 1 event was reported in one 6/8 patient. The low frequency of severe diarrhea did not allow any formal statistical analysis.

[0225] Total Bilirubin: Correlation with TA Indel Genotype and Toxicity

[0226] Pretreatment total bilirubin levels were obtained in all patients (0.5±0.22 mg/dl, mean±SD, n=66). As is shown in FIG. 5, total bilirubin levels were significantly correlated with the TA indel polymorphism (nonparametric trend analysis, 7/7>6/7>6/6, z=2.88, p<0.01). Total bilirubin levels were significantly higher in 7/7 patients compared to 6/6 and 6/7 patients combined (0.80±0.29 and 0.48±0.19 mg/dl, respectively, p=0.0003). Concerning the distribution of the −3156 genotypes withing each TA indel genotype group, in the 6/7 genotype group, the three patients with GG genotype had low bilirubin levels of 0.3-0.4 mg/dl. Similarly, the two patients with 6/8 and GG genotypes had low levels of bilirubin of 0.2-0.3 mg/dl. The one patient with GA genotype in the 7/7 group has a bilirubin level of 0.6 mg/dl, which is in the low range for this genotype group. The 7/8 patient did not have markedly elevated levels of total bilirubin as would be expected if the TA₈ allele resulted in decreased glucuronidation.

[0227] In addition, the −3156 and the TA indel variants were correlated with total bilirubin by multiple regression analysis. The AA genotype showed a slightly better correlation (r²=0.28, p<0.0001) compared to 7/7 genotype, either when the TA₈ alleles were regarded as TA₆ (r²=0.23, p=0.002) or TA₇ (r²=0.20, p=0.0009). The other common variant −3279G>T had no significant association with total bilirubin.

[0228] Whether pre-treatment bilirubin would correlate with neutropenia was also analyzed. Significantly higher bilirubin levels were observed in patients with grade 4 neutropenia (0.83±0.21 mg/dl) compared to those without grade 4 neutropenia (0.47±0.20 mg/dl) (p=0.0001) (FIG. 6). No cases of grade 4 neutropenia were reported in patients with bilirubin levels less than 0.6 mg/dl. Out of the 7 patients with total bilirubin higher then 0.7 mg/dl, 4 of them had grade 4 neutropenia.

[0229] Correlation Between TA Indel Genotype and PK Parameters

[0230] Table 9 describes the pharmacokinetic parameters of irinotecan and its metabolites stratified by 6/6, 6/7, and 7/7 genotypes. SN-38 AUC increases while increasing the number of TA₇ alleles (nonparametric trend analysis, 7/7>6/7>6/6, z=2.13, p=0.03). Conversely, glucuronidation ratios (SN-38G/SN-38 AUC ratios) were reduced while increasing the number of TA₇ alleles, (nonparametric trend analysis, 6/6>6/7>7/7, z=−2.16, p=0.03). No significant trend was found for irinotecan and SN-38G AUCs (p>0.05).

[0231] Regression Analysis

[0232] The impact of both pharmacokinetic variability and pre-treatment (including genotype) variables on variability in neutropenia was also examined. Instead of the TA indel genotype, the −3156 variant was used because 1) the −3156 genotype was better correlated with the risk of grade 4 neutropenia and 2)-3156 better reflected the UGT1A1 status of patients, based upon the data on the correlation with total bilirubin. Univariate regression analyses of ANC nadir selected SN-38 AUC, total bilirubin and −3156 genotype as the three best independent variables (Table 10). Gender showed a non-significant correlation with ANC nadir but it was included in further modeling because of possible gender differences in glucuronidation. Other variables did not show any correlation.

[0233] Multivariate Analyses

[0234] Several multivariate predictive ANCOVA models were considered to identify the pretreatment measurements that predict ln(ANC nadir). The final model (r²=0.41) was selected by backward elimination from Table 10 and is presented Table 11. Pretreatment bilirubin level is found to be very significant and negatively related to ln(ANC nadir). Gender and −3156 genotype are found to be marginally significant after adjusting for the total bilirubin level. Ln(ANC nadir) is found to have a lower value in women, and it decreases with increasing number of (TA)₇ alleles (6/6>6/7>7/7). Other factors, such as ethnicity, number of prior regimens, performance status, and ln(pretreatment ANC) are not found to be significant predictors of ln(ANC nadir) after adjusting for −3156 genotype, gender and total bilirubin.

[0235] After determining the predictive model using pre-treatment variables, the post-treatment measurements of irinotecan AUC, SN-38 AUC, SN-38G AUC, and glucuronidation ratio were added to the model as independent variables with the intention of determining the possible mechanism of how the variability in UGT1A1 status affects ln(ANC nadir). The final model selected through backward elimination (r²=0.5141) which best predicts ln(ANC nadir) includes genotype and SN-38 AUC (p<0.001) (Table 12).

[0236] Toxic Death and Response

[0237] One toxic death was reported, as the patient died of neutropenia-related sepsis. He was admitted to the hospital on day 7 of cycle 1 with fever and an no neutrophils detected (white blood cell count of 100 μl). He was empirically treated with ceftazadime, tobramycin, and fluconazole, though no infectious source was ever identified. Despite support with granulocyte colony stimulating factor, the patient remained neutropenic, became septic, and died on day 11. He had 7/7 genotype and the highest level of pretreatment total bilirubin observed in these patients (1.2 mg/dl).

[0238] Concerning the response rates in this trial, three objective responses were observed. Two patients achieved a partial response (one with colorectal and the other with head and neck cancer) and had a 6/7 genotype. One colorectal cancer patient achieved a complete response and had a 6/6 genotype. TABLE 4 UGT1A1 variants typed in this study. Positions indicated are from the first base of the UGT1A1 start site in the UGT1A cluster reference sequence (AF297093). Nucleotide Amino acid change change Exon −3156G > A — Promoter −3279G > T — Promoter TA indel — Promoter 211G > A G71R 1 686C > A P229Q 1

[0239] TABLE 5 Patient characteristics No. of patients Patients Entered 66 Assessable for toxicity 63 Assessable for response 60 Sex Male 39 Female 27 Age, median (range) 60 (34-85) Ethnicity White 50 Black 10 Hispanic 4 Pacific Islander 1 Asian 1 Performance Status 100% 18  90% 31  80% 10  70% 17 Tumor type Colorectal 10 Gastroesophageal 14 Head and Neck 5 Liver 2 Lung 19 Pancreas 3 Unknown Primary 4 Others 9 Prior Radiotherapy 35

[0240] TABLE 6 Frequency of UGT1A1 promoter haplotypes. −3279 G > T −3156G > A TA indel Frequency T G 6 0.55 G G 6 0.13 G A 7 0.25 G G 7 0.03 G G 8 0.02 G G 5 0.01

[0241] TABLE 7 Frequency of haplotype pairs Haplotype pairs Frequency TG6/TG6 0.28 TG6/GA7 0.28 TG6/GG6 0.18 GA7/GA7 0.08 GG6/TA7 0.06 TG6/GG7 0.05 TG6/GG8 0.02 GG6/GG8 0.02 GG5/TG6 0.02 GA7/GG8 0.02

[0242] TABLE 8 Genetic tests for the TA indel and −3156 genotypes Sensitivity Specificity PPV NPV 7/7, grade 4 0.50 (0.19-0.81) 0.95 (0.85-0.98) 0.50 (0.19-0.81) 0.95 (0.85-0.98) −3156 AA, grade 4 0.50 (0.19-0.8 1) 0.96 (0.88-0.99) 0.60 (0.23-0.92) 0.95 (0.86-0.98) 6/6, grade 0-3, 6/8 = 6/6 0.57 (0.44-0.69) 1.00 (0.61-1.00) 1.00 (0.89-1.00) 0.20 (0.10-0.37) 6/6, grade 0-3, 6/8 = 6/7 0.54 (0.41-0.66) 1.00 (0.61-1.00) 1.00 (0.89-1.00) 0.19 (0.09-0.35) −3156 GG, grade 0-3 0.63 (0.49-0.74) 1.00 (0.61-1.00) 1.00 (0.90-1.00) 0.22 (0.11-0.41)

[0243] TABLE 9 Pharmacokinetic parameters and by 6/6, 6/7, and 7/7 TA indel genotypes Irinotecan SN-38^(a) SN-38^(b) TA indel No. of AUC AUC AUC Glucuronidation Ratio^(b) genotype patients (ng * h/ml) (ng * h/ml) (ng * h/ml) (SN-38G AUC/SN-38 AUC) 6/6 30 24412.8 335.9 1954.2 6.52 (7691.6) (167.7) (1361.1) (3.98) 6/7 25 26085.5 458.4 1887.9 5.55 (10814.2) (379.8) (1682.5) (4.79) 7/7  6 25432.9 542.0 1819.1 3.59 (6694.9)  (195.3) (1249.8) (2.81)

[0244] TABLE 10 Univariate analysis of ln(ANC nadir) Independent Variable r² P SN-38 AUC 0.3523 <0.0001 Pre-treatment total bilirubin 0.2979 <0.0001 −3156 genotype 0.2413 0.0003 Irinotecan AUC 0.1273 0.0041 Glucuronidation ratio 0.1171 0.0060 Gender 0.0445 0.0971 SN-38G AUC 0.0411 0.1109 Age ≧ 70 0.0242 0.2231 White ethnicity 0.0128 0.3764 Ln(pre-treatment ANC) 0.0000 0.9749 Performance status 0.0016 0.9923

[0245] TABLE 11 ANCOVA for the final predictive model of ln(ANC nadir) using pre-treatment variables Coefficient SE p-value Intercept 8.1885 0.2767 <0.001 Genotype AA vs. GG + GA −0.9401 0.3986 0.022 Gender Males vs. Females 0.4323 0.2001 0.035 Total Bilirubin −1.8452 0.4816 <0.001

[0246] TABLE 12 ANCOVA for the final predictive model of ln(ANC nadir) using pre-treatment and post-treatment variables Coefficient SE p-value Intercept 8.3111 0.1517 <0.001 Genotype AA vs. GG + GA −1.3798 0.3234 <0.001 SN-38 AUC −0.0019 0.0003 <0.001

[0247] All of the compositions and/or methods disclosed and claimed herein can be made and executed without undue experimentation in light of the present disclosure. While the compositions and methods of this invention have been described in terms of preferred embodiments, it will be apparent to those of skill in the art that variations may be applied to the compositions and/or methods and in the steps or in the sequence of steps of the method described herein without departing from the concept, spirit and scope of the invention. More specifically, it will be apparent that certain agents that are both chemically and physiologically related may be substituted for the agents described herein while the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as defined by the appended claims.

REFERENCES

[0248] The following references, to the extent that they provide exemplary procedural or other details supplementary to those set forth herein, are specifically incorporated herein by reference.

[0249] U.S. Pat. No. 4,582,788

[0250] U.S. Pat. No. 4,656,127

[0251] U.S. Pat. No. 4,659,774

[0252] U.S. Pat. No. 4,682,195

[0253] U.S. Pat. No. 4,683,194

[0254] U.S. Pat. No. 4,683,195

[0255] U.S. Pat. No. 4,683,202

[0256] U.S. Pat. No. 4,800,159

[0257] U.S. Pat. No. 4,816,571

[0258] U.S. Pat. No. 4,883,750

[0259] U.S. Pat. No. 4,946,773

[0260] U.S. Pat. No. 4,959,463

[0261] U.S. Pat. No. 5,141,813

[0262] U.S. Pat. No. 5,264,566

[0263] U.S. Pat. No. 5,279,721

[0264] U.S. Pat. No. 5,428,148

[0265] U.S. Pat. No. 5,554,744

[0266] U.S. Pat. No. 5,574,146

[0267] U.S. Pat. No. 5,602,244

[0268] U.S. Pat. No. 5,645,897

[0269] U.S. Pat. No. 5,705,629

[0270] U.S. Pat. No. 5,786,344

[0271] U.S. Pat. No. 5,840,873

[0272] U.S. Pat. No. 5,843,640

[0273] U.S. Pat. No. 5,843,650

[0274] U.S. Pat. No. 5,843,651

[0275] U.S. Pat. No. 5,843,663

[0276] U.S. Pat. No. 5,846,708

[0277] U.S. Pat. No. 5,846,709

[0278] U.S. Pat. No. 5,846,717

[0279] U.S. Pat. No. 5,846,726

[0280] U.S. Pat. No. 5,846,729

[0281] U.S. Pat. No. 5,846,783

[0282] U.S. Pat. No. 5,849,481

[0283] U.S. Pat. No. 5,849,483

[0284] U.S. Pat. No. 5,849,486

[0285] U.S. Pat. No. 5,849,487

[0286] U.S. Pat. No. 5,849,497

[0287] U.S. Pat. No. 5,849,546

[0288] U.S. Pat. No. 5,849,547

[0289] U.S. Pat. No. 5,851,770

[0290] U.S. Pat. No. 5,851,772

[0291] U.S. Pat. No. 5,853,990

[0292] U.S. Pat. No. 5,853,992

[0293] U.S. Pat. No. 5,853,993

[0294] U.S. Pat. No. 5,856,092

[0295] U.S. Pat. No. 5,858,652

[0296] U.S. Pat. No. 5,861,244

[0297] U.S. Pat. No. 5,863,732

[0298] U.S. Pat. No. 5,863,753

[0299] U.S. Pat. No. 5,866,331

[0300] U.S. Pat. No. 5,866,337

[0301] U.S. Pat. No. 5,866,366

[0302] U.S. Pat. No. 5,900,481

[0303] U.S. Pat. No. 5,905,024

[0304] U.S. Pat. No. 5,910,407

[0305] U.S. Pat. No. 5,912,124

[0306] U.S. Pat. No. 5,912,145

[0307] U.S. Pat. No. 5,912,148

[0308] U.S. Pat. No. 5,916,776

[0309] U.S. Pat. No. 5,916,779

[0310] U.S. Pat. No. 5,919,626

[0311] U.S. Pat. No. 5,919,630

[0312] U.S. Pat. No. 5,922,574

[0313] U.S. Pat. No. 5,925,517

[0314] U.S. Pat. No. 5,925,525

[0315] U.S. Pat. No. 5,928,862

[0316] U.S. Pat. No. 5,928,869

[0317] U.S. Pat. No. 5,928,870

[0318] U.S. Pat. No. 5,928,905

[0319] U.S. Pat. No. 5,928,906

[0320] U.S. Pat. No. 5,929,227

[0321] U.S. Pat. No. 5,932,413

[0322] U.S. Pat. No. 5,932,451

[0323] U.S. Pat. No. 5,935,791

[0324] U.S. Pat. No. 5,935,825

[0325] U.S. Pat. No. 5,939,291

[0326] U.S. Pat. No. 5,942,391

[0327] U.S. Pat. No. 5,952,174

[0328] U.S. Pat. No. 6,287,834

[0329] U.S. Pat. No. 6,319,678

[0330] U.S. Pat. No. 6,395,481

[0331] U.S. Pat. No. 6,407,117

[0332] U.S. Pat. No. 6,472,157

[0333] U.S. Pat. No. 6,479,236

[0334] European Appl. 201,184

[0335] European Appl. 237,362

[0336] European Appl. 258,017

[0337] European Appl. 266,032

[0338] European Appl. 320,308

[0339] European Appl. 329,822

[0340] European Appl. 50,424

[0341] European Appl. 84,796

[0342] French Appl. 2,650,840

[0343] Great Britain Appl. 2 202,328

[0344] PCT Appl. PCT/US87/00880

[0345] PCT Appl. PCT/US89/01025

[0346] PCT Appl. WO 88/10315

[0347] PCT Appl. WO 89/06700

[0348] PCT Appl. WO 91/02087

[0349] PCT Appl. WO 92/15712

[0350] PCT Appl. WO 96/01127

[0351] Ando et al., Cancer Res., 60(24):6921-6926, 2000.

[0352] Araki et al., Jpn. J Cancer Res., 84:697-702, 1993.

[0353] Ausubel et al., In: Current Protocols in Molecular Biology, Green Pub. Assoc., Inc., and John Wiley & Sons, Inc., NY, (I):2.10.3, 1989.

[0354] Bellus, J. Macromol. Sci. Pure Appl. Chem., A31(1): 1355-1376, 1994.

[0355] Beutler et al., Proc. Natl. Acad. Sci. USA, 95(14):8170-8174, 1998.

[0356] Bosma et al., Hepatology, 15:941-947, 1992.

[0357] Bosma et al., N. Eng. J. Med., 333:1171-1175, 1995.

[0358] Burchell et al., DNA Cell Biol., 10:487-494, 1991.

[0359] Clarke et al., Handbook of Experim. Pharm., 112:3-43, 1994.

[0360] Cunningham et al., Lancet., 352(9138):1413-1418, 1998.

[0361] Devaney et al., Anal. Chem., 73:620-624, 2001.

[0362] Froehler et al., Nucleic Acids Res., 14(13):5399-5407, 1986.

[0363] Frohman, In: PCR Protocols: A Guide To Methods And Applications, Academic Press, NY, 1990.

[0364] Fuchs et al., J. Clin. Oncol., 21(5):807-814, 1993.

[0365] Ghosh and Bachhawat, In: Liver Diseases, Targeted Diagnosis and Therapy Using Specific Receptors and Ligands. Wu et al., eds., Marcel Dekker, NY, 87-104, 1991.

[0366] Gibbons and Charkraborti, In: Nonparametric statistical inference, 3^(rd) Ed., NY, Marcel Dekker, Inc., 1992.

[0367] Guillamette et al., Cancer Res., 60:950-956, 2000.

[0368] Gunn J. Hered, 29:137-139, 1938.

[0369] Gupta et al., Cancer Res., 54:3723-3725, 1994.

[0370] Gupta et al., J. Clin. Oncol., 15:1502-1510, 1997.

[0371] Halushka et al., Nat. Genet., 22(3):239-247,1999.

[0372] Inazuka et al., Genome Res, 7(11):1094-1103, 1997.

[0373] Innis et al., Proc Natl Acad Sci USA. 85(24):9436-9440, 1988.

[0374] Innocenti et al., Drug Metab. Dispos., 29:686-692, 2001.

[0375] Innocenti et al., Pharmacogenetics 12(9):725-33, 2002.

[0376] Innocenti et al., Pharmacogenetics 13(3):183, 2003.

[0377] Iyer et al., Phamacogenomics J., 2:43-47, 2002.

[0378] Iyer et al., J. Clin. Invest., 101:847-854, 1998.

[0379] Johnson et al., Nat. Genet., 29(2):233-237, 2001.

[0380] Kaneda et al., Cancer Res., 50:1715-1720, 1990.

[0381] Ke and Cardon, Bioinformatics, 19(2):287-288, 2003.

[0382] Komher, et al., Nucl. Acids. Res. 17:7779-7784, 1989.

[0383] Kuppuswamy, et al., Proc. Natl. Acad. Sci. USA, 88:1143-1147,1991.

[0384] Kwoh et al., Proc. Nat. Acad. Sci. USA, 86:1173, 1989.

[0385] Kwok et al., Genomics, 23(1):138-144, 1994.

[0386] Kwok, Annu Rev Genomics Hum Genet., 2:235-58, 2001.

[0387] Kwok and Chen, Curr Issues Mol. Biol., Apr;5(2):43-60, 2003.

[0388] Landegren, et al., Science 241:1077-1080, 1988.

[0389] Maxam, et al., Proc. Natl. Acad. Sci. (U.S.A) 74:560, 1977.

[0390] Monaghan et al., Lancet, 347:578-581, 1996.

[0391] Mullis et al., Cold Spring Harbor Symp. Quant. Biol. 51:263-273, 1986.

[0392] Nei, In: Molecular evolutionary genetics, NY, Colyumbia University Press, 1987.

[0393] Nickerson et al., Nucleic Acids Res., 25:2745-2751, 1997.

[0394] Nickerson et al., Proc. Natl. Acad. Sci. USA, 87:8923-8927,1990.

[0395] Nyren et al., Anal. Biochem. 208:171-175, 1993.

[0396] Ohara et al., Proc. Natl. Acad. Sci. USA, 86:5673-5677, 1989.

[0397] Owens and Ritter, Prog. Nucleic Acid Res. Molec. Biol., 51:305-338, 1995.

[0398] Prezant et al., Hum. Mutat. 1:159-164, 1992.

[0399] Purba et al., Br. J. Clin. Pharmacol., 23:447-453, 1987.

[0400] Ramirez et al., Pharm. Res., 19:588-594, 2002.

[0401] Ratain et al., J. Clin. Oncol., 20(1):7-8, 2002.

[0402] Remington's Pharmaceutical Science”, 15^(th) Ed., pg. 1035-1038 and 1570-1580.

[0403] Ritter et al., J. Biol. Chem., 267:3257-3261, 1992.

[0404] Rothenberg et al., J. Clin. Oncol., 11(11):2194-21204, 1993.

[0405] Rothenberg et al., J. Clin. Oncol., 19(18):3801-3807, 2001.

[0406] Rougier et al., Lancet., 351(9117):1677-1681, 1998.

[0407] Rozas and Rozas, Bioinformatics; 15:174-175, 1999.

[0408] Rozen and Skaletsky, www.genome.wi.mit.edu/genome software/other/primertml; 1998.

[0409] Saltz et al., N. Engl. J. Med., 343(13):905-914, 2000.

[0410] Sambrook et al., In: Molecular cloning, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 2001.

[0411] Sanger, et al., J. Molec. Biol., 94:441, 1975.

[0412] Schneider et al., In: A software for population genetics analysis, Geneva, Switz.: Genetics and Biometry Lab., Univ. of Geneva, Switz, 2000.

[0413] Sokolov, Nucl. Acids Res. 18:3671, 1990.

[0414] Stephens et al., Am. J. Hum. Genet., 68:978-989, 2001.

[0415] Stocker, Science, 235:1043-1046, 1987.

[0416] Sugatani et al., Biochem. Biophys. Res. Commun., 292:492-497, 2002.

[0417] Sugatani et al., Hepatology, 33:1232-1238, 2001.

[0418] Syvanen et al., Genomics 8:684-692, 1990.

[0419] Taillon-Miller et al., Genome Res, 8(7):748-754, 1998.

[0420] Takasuna et al., Nippon Yakurigaku Zasshi 105(6):447-60, 1995.

[0421] Te et al., Transplantation, 69:1882, 2000.

[0422] Tucker, J. Pharm. Pharmacology, 46:417-424, 1994.

[0423] Tukey and Strassburg, Annu. Rev. Pharmacol. Toxicol., 40:581-616, 2000.

[0424] Ugozzoll et al., GATA 9:107-112, 1992.

[0425] Vanhoefer et al., J. Clin. Oncol., 19(5):1501-18, 2001.

[0426] Walker et al., Proc. Natl. Acad. Sci. USA, 89:392-396, 1992.

1 13 1 17483 DNA Homo sapiens 1 tcaccgcttc ctccctgtcc tcggggtttt tgtcggggtg ccacttgagc gccagcttgc 60 ggtacgcctt cttgataccc tcggacgagg cctaccgggg tactcccagc acctcgtagt 120 agtccactat gctggactgc caaagagcct gcggggcact ggcacagcga gcggcaaggc 180 tgccagcacc cgcgcacagg tcagaggctt ggcgacctgg gccgcctgga gggccgcccc 240 ttatgacgca gccacatctc attggccgag gcctgtgagc gcctcgcatc ccaagatgca 300 gtgctcctgg gactggccct gctctctgtg aggctctgtg aggccctgtg atgctccaag 360 accaggcccc gcccactccg gcctccaacc agccatggtc tccaaaaagg atgggaaaaa 420 gaggttgggg aaaagagagg gccttgactt tggctgcctg aagaactgtt tttcttaaag 480 taggctttat atcagtcttt ttcctcggcc acaggaggga agagggtggt gggagtgagt 540 ttagtctgac cggggctgaa gacatcctgt tgtttaggac tgcggttctc caacgttcca 600 gccccggtgc ccatttgctt ttgttcatct ggattatgcc tatcatatgt actgcattag 660 agattaaaac agaattaaaa agacatattc attgggcaat ttaagaagaa taaacccatg 720 acacactaac aaaccttttt atgtaacttt ttttgagaca aaatgtagtg agaagagtgg 780 catcgtttta cagtttttgc atctctctct ttagtacttg gctctataga gaggtggatt 840 ctcatgtcag cttctgcatt ctatctattg tgatattaca catcccccat gtagcttctg 900 gaaaactcca ctgtacactt gtgggagaat gacaatgaga aaatcaagta acattattac 960 ggaaatagtt ttgactttgt aaaattctcc tgaaaaatta ctggggatcc ctaggatttc 1020 ctggctcata ctttgagaat cgctagtcta gcagagtagt ccctggtatt ctgaagggat 1080 tagtttagga caaccctcct tccccatacc aaaatctaga tgctcaagct ccttttataa 1140 aatgacacag tatttgtata taacctaccc atatcctcct ttaaacctct agtcatctct 1200 tgattacttt tacctaataa atgtaaatgc tatgtaaata gttgttttac agtattggtt 1260 ttttatttgt attatttgta ctgttttttt ttcattgttg ttccccccaa atattttcaa 1320 tctgctgttg gctgaatctg cagatgtgaa gcccaagtat atggagggtc aaatgtgcat 1380 gttattcact tttcttgact gctaaaacaa ccagggagat cctctcagac aaaaggaaat 1440 acagcactat ttactgtatc gaaaccatta agacttgcag gccgtgtgta tagcactggg 1500 gataaacatg ggatgcagtg attatttccg ctagaactgc tatataatga cgatgaattt 1560 tggggggact ttttttgaga tctgagttct cttcacctcc tccttattct ctttttgaca 1620 ctggattctt tgctttgata aattgtgggg caatacacta gtaaaggtca ctcaattcca 1680 aggggaaaat gattaaccaa agaacattct aacggttcat aaagggtatt aggtgtaatg 1740 aggatgtgtt atctcaccag aacaaacttc tgagtttata taacctctag ttacataacc 1800 tgaaacccgg acttggcact tggtaagcac gcaatgaaca gtcatagtaa gctggccaag 1860 ggtagagttc agtttgaaca aagcaatttg agaacatcaa aggaagtttg gggaacagca 1920 agggatccag aatggctaga gggtaagagg cagagggagg gggcaagcag aagggctaga 1980 gaggaggaat gagcttggac aggtgggctg gggtctatcc cagagttttg agagcaaggc 2040 agaggactct gaattttctg tgcccaggaa gctgctgacc aaggttccag aagtggtggt 2100 gaggtggggt ttattcaggt gggcagccga tgcaatgatt caaaagggac agctgggggt 2160 tgggggacca gggggggctg gggccctgaa atgggaccat gacagctggg tctgagagac 2220 agtggtagaa acatccagat tcagcactta cttgctggct tggatgcagg gtctagaacg 2280 aaaagagaag aaaagtcact tctatacaga aacatgtcca gagcgcttac tgtctccaaa 2340 accatggact ggcacctgag tgatagcatg attccaaagc caaaatcttg cctgtaagga 2400 atatatatat atatatatat atatatgtat atatgatata gctatagtct aatagcaagg 2460 acagatatgc aaactgctaa aagatacaag gcagaacaga acaaaatgct gtttttctgg 2520 gattttggaa attcaaggaa ttcaagggat tcaaggaagg tggctttgct tcccgggagg 2580 gtcctgtaga tgatctacag ggcactggac atgtttatgt tgctccttta gtaataagcc 2640 tgtcattctg atttgatgaa aggagatgaa aggagctggt agtgtgtctg atggtggcct 2700 actaacttat gtcttcagct taaaaagaaa gtagcttcaa aagggttcca gaaacacttt 2760 ccatggacgt gtcactcttt agcagccccc aaagcaagac catcatattg ctgccctgct 2820 gtgtgatttc tcagccccta gagcaccatc ccctgtaatt gcctggtcat gagtttgtct 2880 ctgtctacct gacccctcct ttcaggcaag gaccatttct aacttgactt tctgggccta 2940 gttcctagca tagtgactgc catccagtag ggctcacacg ttccataaat atttggcaga 3000 tgagggaatt agcaatgggt tctgctttgg tttcagagca gatattaatt ggattgctta 3060 gtagtggttc tctgttgtaa ttcatgagca tgaatgtgga ttgcccacta ttcagattag 3120 taagtatttc ttggtcaagg gcagagctgt ggccacaaac catccaggta cacagcagaa 3180 gcagcctcaa aaagcttgga agctctgcat gatgcaggaa agtcataaaa tcattacagt 3240 ggtgacttat gtgtttatag cccctttact gtctataatc tgcaaatgaa ctcacacagc 3300 attgggactt tggaagaatt atcaccctta aggtttaaat taaactgtga atttcagaat 3360 ttctaataag gacacaacaa agagtgaaag cattgctatg tctattctgc ttgcccagaa 3420 tcttggtcct aaaaaatgaa gagtgtttgg gtgtggggag gagcttcagt gtgcatgtgc 3480 atgcaaagta cctactctaa ggagaagaat gagagggtac cctaattacc tgttaatatg 3540 tcccatagga caccaaaact ctagttagct gtttctctat gatcctctaa gcacatcccc 3600 aagtatggct ggccagtgat gtgtatggtt caaatgttgg gatctgtgca gttatcttgg 3660 aattgtatag tacagcagta tatccccccc aaaaagagtg taatacttcc aattctggct 3720 gcacaatact tgccccatag tccatggtca ataaatacaa atttgagttg tttttgctca 3780 tctttccctt ttgacttcaa atcagtcatc agaatttccc caaatgcctt tcccctggat 3840 cttgggccag tggaatgagt acaatttaac ttaattgaat ttgcttatct atttggtttc 3900 ctgttgtgaa caaaagttct ctgaaaagga atttggaaga aagagacttt gttctagtga 3960 acagtttgca aaccagggag ttacagcctc tggtacgcaa tgaaggtgag ttccacagaa 4020 cacaaggcag gcaggtttca cggcaaaaag ttccttccca ggttcccaat caggtccatt 4080 tatgcaaatg aaggatggaa acttgcttag ttcttattgg tcactgcagc tgcattctga 4140 ttggttgatg aagctgagcc ctgagtggct gaggtgggtg agctttaatt ggttggttca 4200 ggtgagcgct gaaaatctca actataaaaa ggtacaggtt ttcaggatac tcagagtaac 4260 cgtgtgacct gtagtaagca aagggccagt tggctctatt ttaaatccag gcccagttag 4320 ccactcaaga tctatcttac aggactggct ctttcaggtt cacactaata aaggcctgtc 4380 cttggggaag acttctgttc acatgcgctc cagtgaattt ccctttctgg tcattctcta 4440 ccccagcacg ccccccaccc ccgacccgcc ccacccaccc acctgttcat ttccttctta 4500 gcatgcttca cgatttctaa gttcctgctc atgtgtttaa attgtgagtc tggctcacct 4560 catggcgcgt gctcgtgtgg tgggctctgc tgcagcctca agaccccaca ctgtgctgga 4620 ctcaataaat attgttggac gaaggaatga aacacatgat acaagtgagc aggcagtacc 4680 gggggagctg tggagtgggc actcttacag gtttccatgg cgaaagcggg ggtacagttg 4740 tgttcttttc tttctaaaag gctttctaaa aagccttctg tttaatttct ggaaaagaag 4800 cctaacttgt tcactacata gtcgtccttc ttcctctctg gtaacacttg ttggtctgtg 4860 gaaatactaa tttaatggat cctgaggttc tggaagtact ttgctgtgtt cactcaagaa 4920 tgtgatttga gtatgaaatt ccagccagtt caactgttgt tgcctattaa gaaacctaat 4980 aaagctccac cttctttatc tctgaaagtg aactccctgc tacctttgtg gactgacagc 5040 tttttatagt cacgtgacac agtcaaacat taacttggtg tatcgattgg tttttgccat 5100 atatatatat ataagtagga gagggcgaac ctctggcagg agcaaaggcg ccatggctgt 5160 ggagtcccag ggcggacgcc cacttgtcct gggcctgctg ctgtgtgtgc tgggcccagt 5220 ggtgtcccat gctgggaaga tactgttgat cccagtggat ggcagccact ggctgagcat 5280 gcttggggcc atccagcagc tgcagcagag gggacatgaa atagttgtcc tagcacctga 5340 cgcctcgttg tacatcagag acggagcatt ttacaccttg aagacgtacc ctgtgccatt 5400 ccaaagggag gatgtgaaag agtcttttgt tagtctcggg cataatgttt ttgagaatga 5460 ttctttcctg cagcgtgtga tcaaaacata caagaaaata aaaaaggact ctgctatgct 5520 tttgtctggc tgttcccact tactgcacaa caaggagctc atggcctccc tggcagaaag 5580 cagctttgat gtcatgctga cggacccttt ccttccttgc agccccatcg tggcccagta 5640 cctgtctctg cccactgtat tcttcttgca tgcactgcca tgcagcctgg aatttgaggc 5700 tacccagtgc cccaacccat tctcctacgt gcccaggcct ctctcctctc attcagatca 5760 catgaccttc ctgcagcggg tgaagaacat gctcattgcc ttttcacaga actttctgtg 5820 cgacgtggtt tattccccgt atgcaaccct tgcctcagaa ttccttcaga gagaggtgac 5880 tgtccaggac ctattgagct ctgcatctgt ctggctgttt agaagtgact ttgtgaagga 5940 ttaccctagg cccatcatgc ccaatatggt ttttgttggt ggaatcaact gccttcacca 6000 aaatccacta tcccaggtgt gtattggagt gggactttta catgcgtata ttctttcaga 6060 tgtattactt tggatcgatt aactagcccc agatatatgc tgagcaagca ttctgagata 6120 atttaaaatg ccctcttttg ttaatttttg actcctaggt ttgagtctgt ctttggcatc 6180 atcttctgga tgatttcttg gtatctgaga tttcgggaaa gcattccttg gacattttac 6240 tctgtgtgct ccagtggata gtaatcaatt agaaacaaca agctgttaaa tgccataggc 6300 acagaatgct gggtttgggg caccctgcag aaaactcagt tgaagcctgc accttgccct 6360 ggattcagtc aggcaggcaa tgttcaggac tgatgaaatc attctttgat gatgatagat 6420 cctggaaatg aaagttgcct ttgtgaccct ggttaaagct ccagtttcta aatattctga 6480 taagaagcta aatcctgcag tccgttctct tctaatgagt gaatcaccag acagtcaggt 6540 tctgacatga tacagaaagg ttgtaggttt cattctcaag ctattaggtt tatttttccc 6600 ctacagagtt tgaagtatgc aaaaagtagc attcacatcc tcatcgaaat ctcagcagag 6660 gatagaaaag aacaggagag gctccttcag atggagcgtt agggaattac tctttgagga 6720 ggtgacattt cagagagcgt tcattcactt atcctgcaaa gattggctga ggatctactg 6780 gcagcccagg cacttcccag gtgctgcgtc tggctcccat taaggggact gatatcacct 6840 tcggaggtga ccttatttcc actatacctc caatgtgatt tgtattttat tttttttaat 6900 tttctgtgca ttttccttca tagcacatca aatatggcag ccatttcact tagatagttg 6960 ttgattgtcc gcttcacatc atgagccatg tggggacctg tgtgactttg cattaatcac 7020 atccactgta tgcggcgtcc tcaacacctg ccaatgggtc tgcatgtatt tggcgcccca 7080 taaatctcag cacctaaggc acagaatagg cacccaccga atatgtgtta cattaatgaa 7140 tgagaagaaa ggtgccaacc gaggtctagt taatgggtcg agagtaatcc acaatagctc 7200 tttttagttc tttgtactcc agctattaca taccaatatg tatatagaaa catatgtaaa 7260 attttttggt tgctttttct acaaaataga gtaacagtgt attcccactg cccacttacc 7320 gataatgtca tggatatcac tccagtttta aatgctatta ctttttaaac tatgaaatag 7380 tatttcatgg tacttgtgta ccacagtgta ttctgctgga gatctagtct agttccccac 7440 agaggaacat tacaatttgt attccaggag ttttgttgtt gtgacctcaa acacttcctt 7500 taaaaagata agctattttg tagtttaaaa aacatttgtt ctgtttcttt ctcattcatc 7560 ttttcttaag tattttacac ggtttttttt tttggtcact actgtgaatg tgttattttt 7620 ttgcatttct atctctagct gattatctac tcattactca gctatctcat caaaatattg 7680 attttcataa taaaaaataa taggcagtca tttgctgata aagaaatttt ggtttcttct 7740 cttataaatt ccatgccaaa tatcagggct attgaattta ttagaatctc taaaaacagt 7800 tgaataattc tggcaatagg aaagatgccc gtcttgctgc tattttagtg gaaattgatt 7860 atcatttcat tattttgcat tatgttagcc attgttttct gaacaggctt tattgattta 7920 gataatttcc ttctttgcgt gaggatgttt gtaggagagg caccgaactt tatcagctgc 7980 ctttctggca tttattgata taaccataaa agtctaagtg gtgaactgtg ttgactacat 8040 atttgttgtt gccttgtttg gtgcagtcag gcttaggtgt gaaaatatgt ttttaaattg 8100 taccttttag taacctgttt tgtcttgttg catgttttaa tctgaaattc cactttttgg 8160 atattaatat taccacttct gtattatttt tgtttacatt tccctagcac atctttagta 8220 ctcctttgtc ttcaagcttt cttccttttt aaacaacatg gcactggtat ttttaatcca 8280 gtcaggcagt tgctttaata agtgcatttt gcctatttga atctaacaat taatagattt 8340 gattgtaact ctctcagttt actttatgtt tagttgactt tgccattctc ctttttccgg 8400 atttctactg gttggtcaag ttactgttct tattttctct ttcttccttt gttaactaaa 8460 aatgccactc tgcactacca ttcctcttgt gttgatggtc ctattctcaa tactcttgat 8520 aaaactcctg aactttaaga ataaagataa aacttttatt gcacaaagaa gtccatagag 8580 aaagcacaac ctggcattgg cgtgtctttg gtgtgtctga aggaaaagag atagtggaac 8640 aacattggga gaaaaggaat gaaactcaag aattccaaga tgttcctccc ctgccagggt 8700 aagatagcag tggttcacag acaatcgcaa tgctgggtct gagaaaaata actaaacaga 8760 agattagtga ggaccaaggc ttcgagatgg ccaggagagg aaagcttggg agcagggaag 8820 gttgagatat atgtgggtta ctgggaatgc gtgatggtga agtcacagat gacccacatg 8880 gtgtctaagt gctaaagaag aattctggga aaatgaaatg catttgggaa gggaaaatct 8940 aattaaaagc ctaaactaaa aatacaaaat tcttggtaaa gtttaggagt tatgttaaat 9000 gtctcatttt ggctggtgaa gtctcatcag aacagggaaa ttctctcatt caggggcatc 9060 tcatcttttc tttgaaggga atcaatggtg ggggattgga gtgttatttt cagttaatat 9120 gttgcttcac tctttggtca ttccggtaac tgtgaagtca gggtgaagtt taagggaagc 9180 tttgccaagt aggggatgga cttcaccttt attgagcctc atagtagctg gctcaggtag 9240 gagttggccg tgatgacaac ttctctgcag tttgccctgc gtgaatctcc agatgaactt 9300 ttgtgccatt taaactttcg tgatctcctg ctatttaact tcgaatgttt atggacctgt 9360 gggttcaatt ttgtgtgaat cacatcctgc tgattgctga gtgggcgtgt gggagggtgt 9420 gcctggagga gaacttagac tcggcctttt ccagatgagc ttcagtgtaa gagtgggttt 9480 catgaagagc aaaggtccta ggaaatttaa gtaagccatt taccaacgct cagaagaaag 9540 aacttgaaga gcacttggaa atgagctgtg tctccccaag aaagagggag agaaagaggg 9600 gagagatgtg gtgcagaccc tagggaggaa ggagttcaga aaaaccatcc tcagggtgtt 9660 cttgctacaa accaaaaaat gcagcatggt ggtggggagg atgactctgt cctccctgac 9720 ttttagatga gcccaaggga aaaggcaaag acaaagccct taagagccag aggactcacg 9780 agggcctggg gctggtgaga gtggcgggga gagagggctc accttgggag aaggatggtc 9840 agtgtctggg gctttcctgg tcatgttcca aaataggctt ggcaggagtt ctgctgggaa 9900 aatggggttg gttgaccctg caaaaggtct cctgtgtctc acatttaggg tgaccagcat 9960 cctggcttcc tcaggactgt tcaggtttta gcactgaaca tcacatgtcc tagggaaccc 10020 ctcagtttgg gcaagccctg ccacatcaca caatcatatt agtgccctca gtattctttg 10080 caaacataaa accatagact cagtaatccc attactgggt atatacccca aagaaatata 10140 aattattcta ctataagaca catgcacata tttgtttatt gcagcactat tcacaataac 10200 aaagtcatgg aaccaaccca gatgcccatc aatggtagat tggataaaga aaatgtggta 10260 catatacacc atggaatact atgcagccat aacaaggaat gagatcatat tctttgcaag 10320 gacatggatg aagctggaag ccatcatcct ccacaaacta acacaggaac agaaaatcaa 10380 acaccgcatg ttctcactca taagtgggag ttgaacagtg agaatgcgta gacgcaggga 10440 ggggaacaac acacaccagg gcttgtggcg gggtgagggg tgaggggagg aacttagagg 10500 ataggtcaat aggtgcagca aaccaccatg gcatatgtat cccagaactt caagtaaata 10560 ataataataa taattaataa taataataat aataaataaa cccataaagc catttgagag 10620 attcttgggg gattcattgg accactgaaa atctacagtg agaaaagaat tgccatgttg 10680 atgaaacagg aaaactttcc ttgtccccct cacagagcat gtgacagcgg gaggggctca 10740 ctttctcagt gcgccactgc tcaaacctct aggggagcat acagacgggc aggttgtggg 10800 gctctgacct caccggcagt gtttagaggt ggatgtttac aggctctgaa gcttccaggg 10860 gcgggggtta tggcctttct ttaagttttg ccctctatag tcagcttgtg ttaaccagct 10920 caattacacc ctctaccttg tcgcaaggac agagggcttt ctgtatcctg ggggcttgcc 10980 ttggtgtacc agaagaatcg aatcccacct gggcttggag aatgagtgca aggatttatt 11040 gagtggatgt agctctcagc agatggggga agccagaagg ggatggaatg ggaagggttt 11100 cccctggagt cagaccgctc agtggcccgg gctcggtggc ccgggctcgg tggcctgggc 11160 tctcctccga ctgcctcagc caaactccgc gttgttctgc tggtcagtgg cctgccggtg 11220 cctgttggtg agttcttctc aatgtccagc tgtccttgcg tccctccgct gatgtgctcc 11280 tcccgatgtc cagctacctg tgtgtctgcc tgctagggtc ttggggtttt tataggcaca 11340 tgatgggggc gtggcaggcc agggtggttt tgggaaatga aacatttagg caggaaaaca 11400 aaaatgcctg tcctcaccta ggtccatggg cacaggtctg ggggtggagc cctcgccagg 11460 gaccacaccc tcttctaccc agcacttccc ttccctactt ccatatcatt taaagggacc 11520 acgcccttcc cagctcttcc cttctgtatc actgatgcct tgctctgtgt tctctaagtg 11580 gaattatcac tgtgtgtatg tacaggtgtg tgcatgtgtg tgcatgtacc tgtgcttttc 11640 ttttggaaaa ctagcacatt acctggattt tgcatctcaa ggataattct gtaagcagga 11700 acccttcctc ctttagaagg aagtaaagga gaggaaaatg ctgtaaaact tacatattaa 11760 taatttttta ctctatctca aacacgcatg cctttaatca tagtcttaag aggaagatat 11820 ctaattcata acttactgta tgtagtcatc aaagaatatg agaaaaaatt aactgaaaat 11880 ttttcttctg gctctaggaa tttgaagcct acattaatgc ttctggagaa catggaattg 11940 tggttttctc tttgggatca atggtctcag aaattccaga gaagaaagct atggcaattg 12000 ctgatgcttt gggcaaaatc cctcagacag taagaagatt ctataccatg gcctcatatc 12060 tattttcaca ggagcgctaa tcccagactt ccagcttcca gattaattct cttaattgga 12120 accttagatt tggcttttcc ctgccacttc ccaactatta atccaaaggt tttttttgtt 12180 gttgtggttg ttgtcattgt tttcaatttg actctcaaat actctattaa actatgatcc 12240 accacactca gaagtatcat tttctctaag agactcaaaa gtgtattagg gagaatttat 12300 ttaaaaataa aataaatggg atattgtttc ttcatattaa atagaagtat ttctccaaaa 12360 agctgttggt tagaacactg aatttatgtc ttacatttct gctcttatag ttctgcatcc 12420 acttgtttca ttaagcaaac tttcccttaa agtgcaggaa agtgaaaaaa tcctaagtgc 12480 acagcttgat aaattatcac aaattcacgt agtgcataca cccttgtaac taaacctcca 12540 aaacaagatg ccggaagttg ccagtcctca gaagccttca cagttactga tcctcccact 12600 ctgttaaaga ctgttccttc agaggacccc tgttttctag ttagtatagc agatttgttt 12660 tctaatcata ttatgttctt tctttacgtt ctgctctttt tgcccctccc aggtcctgtg 12720 gcggtacact ggaacccgac catcgaatct tgcgaacaac acgatacttg ttaagtggct 12780 accccaaaac gatctgcttg gtatgttggg cggattggat gtataggtca aaccagggtc 12840 aaattaagaa aatggcttaa gcacagctat tctaaaggat tgttgagctt gaaaatatta 12900 tggccaacat atcctacatt gctttttatc tagtggggta tctcaaccca cattttcttc 12960 tgcaaatttc tgcaagggca tgtgagtaac actgagtctt tggagtgttt tcagaaccta 13020 gatgtgtcca gctgtgaaac tcagagatgt aactgctgac atcctcccta ttttgcatct 13080 caggtcaccc gatgacccgt gcctttatca cccatgctgg ttcccatggt gtttatgaaa 13140 gcatatgcaa tggcgttccc atggtgatga tgcccttgtt tggtgatcag atggacaatg 13200 caaagcgcat ggagactaag ggagctggag tgaccctgaa tgttctggaa atgacttctg 13260 aagatttaga aaatgctcta aaagcagtca tcaatgacaa aaggtaagaa agaagataca 13320 gaagaatact ttggtcatgg cattcatgat aaaattgttt caaatatgaa aacatttacg 13380 tagcatttaa tagcgttgtt tcaaatataa aaacaaatac ataaaaatct ggatttttat 13440 ttcttccttt tttttttttt ttttttttga gatggagtct tgctctgtca cctaggctgg 13500 agtgcagtgg tgcaatcttg gcttactgca acctccacct cccacgttca agcagttctg 13560 cctcagcctc cgtgtagctg ggattacagg tgtccaccac cacgcccggt taatttttgt 13620 attttttagt agagaaaggg tttcaccatg tttgtcaggc tggtcttgaa ctcctgactt 13680 caggtgatcc acctgcctcg gcctgccaaa gtgctgagat tacaggcatg agccagcgcg 13740 tctgacctgg atttataaat aagataattt agaggttatt attcacttta taaaaggatt 13800 ctttagtttc tatataattt atcatataat ttatttagaa ttttatttcc cccattagat 13860 ttaaaactcc aatttacata aaaagttgcc ataatagaca tctgatccat aagtttcctg 13920 cacagaaaga aatactccat tataagaagc atagtatctt taagagaaaa acaactcaaa 13980 tgcttagaag tacagctttt tgcagcactg gaacctgtga gaaattttgt ccatggagtt 14040 tatgaatgaa ggagctataa gatatcacag acaaagtctt agaataagag caaaggaaaa 14100 tttgctcaaa tgtggccctg aaaacgattc aaagggcaaa tgatttctgg attaaagtta 14160 gtatattact gtcaagctca ctggtaatag gcttattaga accttatggg aagaagtggt 14220 ggccagtggt agatttcatc cgacaataga tactgtgtgc atatgtgcgt gtgcgtttgt 14280 gcatgtggct gtgctcatgt gtgggtgcac acgtgtgcat tcatatgcgt gtgtgtgtgt 14340 gtgcgtgtgt ttatgagagt gtccattgct ttctcccatg gttacctcct ttagaaagaa 14400 gcagcagtca ggaagacaga tgtgaagagc tggagcatgt tcagatgaga ggagacggaa 14460 cacggggaca caccagcttg agcaagggac aacaggggag gactgatgac tgacttccca 14520 cctttgaggt gctaatgtgt gtgtggtggc actggataaa agatcaatgt tggctaggca 14580 ccatggcaca cgcctgtagt cccagccact ctggaggcta aggcgggagg attgcttgag 14640 cccagaagtt ggaggctgct atgagccgtg atcatgccac tgcactccag caacctgggc 14700 aacagagtga gaccctgtct caaaaaaaaa aaaaaaaatg aaaagtccac ataacctgag 14760 catcatgtgc ccagagcgtt gggtggtgtg gtcccattcc ttccttccag cggcttcttc 14820 tggccacctc aatgtcagga tgtcctgctc acatatcaat accattaaaa cctgacttct 14880 ttccctgcac tgttgaagct ccttcttgag gctcacatta tggatataat tttgattctt 14940 tcttcagtgg tatagataac tacttgtaac ctaagaacaa cttggtgaaa gtcctctaat 15000 acattatttt ttaaaaaaac acaaatcaat gagctcaact tattaactaa ctttcatcta 15060 ttcatttttg agccatccct gtctgattgt gaatctccat gattccaaca ctctgagctg 15120 gggatagtgc ctacacaaaa taaaaagaag tggaaaattt tcaaacatca gtttatgctg 15180 acaaccaggc cataataggt gctcaattac tattgaatga atgaatgaaa gttctggcca 15240 ggtacggtgg ctcatgcctg tagtcccaac actttgggag gccgaggcag gtggatcact 15300 tgaggttagg agttcgaaac caacctgacc aacatgaaga aaccttatct ctaccaaaaa 15360 aatataaaaa aattacccag gcatggtggt gtatgcctgt aatcccagct atttgggagg 15420 ctgaggcagg aaaatcactt gaacctgaga ggcggaggtt gcagtgagct gagattgtgc 15480 cactccactc cagcctgggc gacagagtga gactccgtct tacttaaaaa aaaaaaaaag 15540 aaggttccaa gaaaattcat cttaaggttt atgtaaaagg aagatgatat ttaacatgat 15600 tcatggccaa gtactaatat tacattataa taatgtttcc aaataacatt atagatatgt 15660 ttaaagacag tgtattaggc tgttcttgca ttgctgtaaa gaaataccca agactgggta 15720 atttataaag aaaagaggtt tcattggctc gtgtttctgc aggctgtaca ggaagcttag 15780 tgctgacatc acttggctgc cgggggaacc tcagggagct tttactcatg gcagaaggca 15840 atgcgggagc ttgcatgtca catggcaaaa gcaggagcga gagagagttg ggggggaagg 15900 tgccacacac tttttaatga ccggctctca caataactca tgaaaactca ctatcaggaa 15960 gacagcacta aagcacaagg gatccgaccc catgatccaa acacctccca ccaggcccca 16020 tctccagcac tggggattac aattcaacat gagatctgag tgtggacaaa tatccaaact 16080 gtatcagtca acagcgatca taattagtcc tgaataggag tgcctttttt tttctttctt 16140 ctcccttttc ttttctactt cctcctcctt ttccctctcc tcttcaatct cctcttcatt 16200 cctgtagcac caagggttga agcacctaac ccgttttgga ttgagatgtt ctgattgggc 16260 aatgaacact gtccagaata aacagaaatc cattttgcac taagtggctg cacagaccct 16320 gcctcatgct aaatctagca cccagatagt ttaatgtttc aatgactgaa ttacaaatat 16380 atcatcacct tggatttggc acttacaaat ggctgttaat ttggccagag gtggttgttt 16440 acaacttcaa ataggagact attcataatt tctgacgtga cattttcctt tctttatttt 16500 actgtatgaa aatataatga aatttctcac aaaatatcac taaaaagaaa agaagaagag 16560 taggaagcaa ggttaaaata tttctaaaat ataattttgg tctttctttt tctcccttcc 16620 ttcctccgtc cctctctcct ttcctctctc cctccctccc tccctccctt cctcctttcc 16680 ttgcttcctt ccctccttct cttccttctt tttcaagaga tcaataacat ttattaagaa 16740 taagtttctt aattataacc tttcaggtga taatagtaac acagcctggg caacacaata 16800 agaccttgtt tctacaaaaa atttaaaaat tggccagaca tagtggtgca tgactaattc 16860 cagctactct ggaggctgag gcaggaggat ggcttgagcc caggagttgg aggctgcagt 16920 tagccatgct tgtgccacta cactccagcc cgggcaacag ggcaagactc tgtatctaaa 16980 aacaacaaca acaacaataa tagaaacagg tttcctttcc caagtttgga aaatctggta 17040 gtcttcttaa gcagccatga gcataaagag aggattgttc ataccacagg tgttccaggc 17100 ataacgaaac tgtctttgtg tttagttaca aggagaacat catgcgcctc tccagccttc 17160 acaaggaccg cccggtggag ccgctggacc tggccgtgtt ctgggtggag tttgtgatga 17220 ggcacaaggg cgcgccacac ctgcgccccg cagcccacga cctcacctgg taccagtacc 17280 attccttgga cgtgattggt ttcctcttgg ccgtcgtgct gacagtggcc ttcatcacct 17340 ttaaatgttg tgcttatggc taccggaaat gcttggggaa aaaagggcga gttaagaaag 17400 cccacaaatc caagacccat tgagaagtgg gtgggaaata aggtaaaatt ttgaaccatt 17460 ccctagtcat ttccaaactt gaa 17483 2 20 DNA Artificial Sequence Description of Artificial Sequence Synthetic Primer 2 gtcacgtgac acagtcaaac 20 3 19 DNA Artificial Sequence Description of Artificial Sequence Synthetic Primer 3 tttgctcctg ccagaggtt 19 4 20 DNA Artificial Sequence Description of Artificial Sequence Synthetic Primer 4 ctggggataa acatgggatg 20 5 20 DNA Artificial Sequence Description of Artificial Sequence Synthetic Primer 5 caccaccact tctggaacct 20 6 22 DNA Artificial Sequence Description of Artificial Sequence Synthetic Primer 6 acctctagtt acataacctg aa 22 7 20 DNA Artificial Sequence Description of Artificial Sequence Synthetic Primer 7 aataaacccg acctcaccac 20 8 19 DNA Artificial Sequence Description of Artificial Sequence Synthetic Primer 8 gccaagggta gagttcagt 19 9 18 DNA Artificial Sequence Description of Artificial Sequence Synthetic Primer 9 gaccccagcc cacctgtc 18 10 20 DNA Artificial Sequence Description of Artificial Sequence Synthetic Primer 10 atgctgggaa gatactgttg 20 11 20 DNA Artificial Sequence Description of Artificial Sequence Synthetic Primer 11 tttggtgaag gcagttgatt 20 12 22 DNA Artificial Sequence Description of Artificial Sequence Synthetic Primer 12 gtcttcaagg tgtaaaatgc tc 22 13 20 DNA Artificial Sequence Description of Artificial Sequence Synthetic Primer 13 gtgcgacgtg gtttattccc 20 

What is claimed is:
 1. A method for evaluating the risk of irinotecan toxicity in a patient comprising determining the presence of a polymorphism in one or both UGT1A1 genes of the patient, wherein the polymorphism is in linkage disequilibrium with a UGT1A1 TA repeat.
 2. The method of claim 1, further comprising amplifying from a nucleic acid sample all or part of 5′ flanking region of one or both UGT1A1 genes to obtain amplification products and analyzing the amplification products for the presence or absence of a polymorphism.
 3. The method of claim 1, wherein the polymorphism is at nucleotide position −3440, −3401, −3279, −3177, −3175, or −3156 from the UGT1A1 gene transcriptional start site.
 4. The method of claim 1, wherein the number of TA repeats is 5, 6, 7, or 8 TA repeats.
 5. The method of claim 1, wherein the polymorphism is a −3440C>A polymorphism.
 6. The method of claim 1, wherein the polymorphism is a −3401T>C polymorphism.
 7. The method of claim 1, wherein the polymorphism is a −3279G>T polymorphism.
 8. The method of claim 1, wherein the polymorphism is a −3177C>G polymorphism.
 9. The method of claim 1, wherein the polymorphism is a −3175A>G polymorphism.
 10. The method of claim 1, wherein the polymorphism is a −3156G>A polymorphism.
 11. The method of claim 1, wherein determining the presence of a polymorphism in one or both UGT1A1 genes of the patient comprises determining the nucleotide sequence at position −3156 in one or both genes.
 12. The method of claim 11, further comprising classifying the UGT1A1 activity level in the patient, whereby identification of a guanine residue indicates the patient does not have a low level of activity.
 13. The method of claim 1 1, further comprising determining the nucleotide sequence at position −3156 of a second UGT1A1 gene in the patient.
 14. The method of claim 11, further comprising administering irinotecan to the patient if a guanine nucleotide is found at position −3516.
 15. The method of claim 1, further comprising analyzing a glucuronidation rate associated with the polymorphism.
 16. The method of claim 1, further comprising optimizing a dose of irinotecan for administration to the patient.
 17. The method according to claim 1, wherein determining the presence of a polymorphism of a UGT1A1 gene or genes is performed by a hybridization assay.
 18. The method according to claim 1, wherein determining the presence of a polymorphism of a UGT1A1 gene or genes is performed by a sequencing or microsequencing assay.
 19. The method according to claim 1, wherein determining the presence of a polymorphism of a UGT1A1 gene or genes is performed by an allele-specific amplification assay.
 20. The method of claim 1, further comprising administering to the patient irinotecan.
 21. The method of claim 20, further comprising administering to the patient a second agent to reduce excretion of an active irinotecan species through the bile.
 22. A method for evaluating the risk of irinotecan toxicity in a patient comprising: determining the nucleotide sequence at position −3156 in one UGT1A1 gene of the patient.
 23. The method of claim 22, further comprising classifying the UGT1A1 activity level in the patient, whereby identification of a guanine residue indicates the patient does not have a low level of activity.
 24. The method of claim 22, further comprising determining the nucleotide sequence at position −3156 of a second UGT1A1 gene in the patient.
 25. The method of claim 22, further comprising administering irinotecan to the patient if a guanine nucleotide is found at position −3516.
 26. A kit for evaluating the risk of irinotecan toxicity in a patient comprising an oligonucleotide primer to amplify a 5′ flanking region of a UGT1A1 gene or genes.
 27. The kit of claim 26, further comprising amplification primers of the UGT1A1 gene, wherein the amplification primers amplify haplotype tag SNPs.
 28. The kit of claim 27, wherein the amplification primers amplify a polymorphism at nucleotide position of −3440, −3401, −3279, −3177, −3175, or −3156 from the UGT1A1 gene transcriptional start site
 29. The kit of claim 27, wherein the amplification primers are comprised in multi-well assay plate.
 30. The kit of claim 26, further comprising specific hybridization probes.
 31. The kit of claim 30, wherein the specific hybridization probes detect polymorphisms at nucleotide position −3440, −3401, −3279, −3177, −3175, or −3156 from the UGT1A1 gene transcriptional start site.
 32. The kit of claim 31, wherein the the specific hybridization probes identify whether there is a guanine nucleotide at position −3156 upstream from a UGT1A1 gene transcriptional start site.
 33. The kit of claim 31, wherein the specific hybridization probes are comprised in an oligonucleotide array or microarray. 